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DIFFERENTIALLY EXPRESSED GENES INVOLVED IN CANCER, 
THE POLYPEPTIDES ENCODED THEREBY, 
AND METHODS OF USING THE SAME 

The present application claims priority under Title 35, United States Code, §119 to 
United States Provisional application Serial No. 60/422,176, filed October 29, 2002, 
which is incorporated by reference in its entirety as if written herein. 

The present application contains a sequence listing in computer readable form under 
Title 37, § 1.53 (e)(5) entitiled 01040_l.ST25.txt (created 10/24/2003, 2,345 KB), 
which is incorporated by reference in its entirety as if written herein. 

FIELD OF THE INVENTION 

[GDI] The invention relates generally to the identification of nucleic acids and their 
encoded polypeptides, whose expression is modulated in cancer or tunior cells. These 
nucleic acids and proteins may not have previously been identified as having a 
biological role in cancer. The invention further relates to methods.useful for treating 
or modulating cancer or tumors in mammals in need of such biological effect. This 
includes the diagnosis and treatment of oncological disorders. Additionally, the 
present invention further relates to the use of antibodies against the polypeptides of 
the present invention as diagnostic probes or as therapeutic agents as well as the use 
of polynucleotide sequences encoding the polypeptides of the present invention as 
diagnostic probes or therapeutic agents for the treatment of a broad range of 
pathological states. The present invention also relates to antisense molecules. 

BACKGROUND OF THE INVENTION 

Cancer background 

[002] The present invention relates to methods and compositions for the diaignosis, 
prevention, and treatment of neoplastic cell growth and proliferation, i.e., tumors and 
cancers (e.g., colon cancer) in mammals, for example, humans. Specifically, genes 



1 



01040/1 



which are differentially expressed in tumor cells relative to normal cells are identified. 
Among these are certain Incyte (Palo Alto, CA) unique genes. 
[003] Malignant tumors, i.e., cancers, are the second leading cause of death in the 
United States, after heart disease (Boring, et al., CA Cancer J, Clin., 43:7, 1993), and 
develop in one in three Americans. One of every four Americans dies of cancer. 
Cancer is characterized primarily by an increase in the number of abnormal, or 
neoplastic, cells derived from a normal tissue which proliferate to form a tumor mass, 
the invasion of adjacent tissues by these neoplastic tumor cells, and the generation of 
malignant cells which spread via the blood or lymphatic system to regional lymph 
nodes and to distant sites. The latter progression to malignancy is referred to as 
metastasis. 

[004] Cancer can result from a breakdown in the communication between 
neoplastic cells and their environment, including their normal neighboring cells. 
Signals, both growth-stimulatory and growth-inhibitory, are routinely exchanged 
between cells within a tissue. Normally, cells do not divide in the absence of 
stimulatory signals, and, likewise, will cease dividing in the presence of inhibitory 
signals. In a cancerous, or neoplastic, state, a cell acquires the ability to "override" 
these signals and to proliferate under conditions in which normal cells would not 
grow. 

[005] Tumor cells must acquire a number of distinct aberrant traits to proliferate. 
Reflecting this requirement is the fact that the genomes of certain well-studied tumors 
carry several different independently altered genes, including activated oncogenes and 
inactivated tumor suppressor genes. Differential expression of the following 
suppressor genes has been demonstrated in human cancers: a retinoblastoma gene, 
RB; the Wilms' tumor gene, WTl (1 Ip); a gene deleted in colon carcinoma, DCC 
(18q); the neurofibromatosis type 1 gene, NFl (17q); and a gene involved in familial 
adenomatous polyposis coli, APC (5q) (Vogelstein, B. and Kinzler, K. W., Trends 
Genet, 9:138-141, 1993). Each of these genetic changes appears to be responsible for 
imparting some of the traits that, in aggregate, represent the fiill neoplastic phenotype 
(Hanahan. D. and Weinberg, R. A., Cell, 100:57-70, 2000). 

[006] Colon cancer is the second leading cause of cancer-related deaths in the * 
United States. The American Cancer Society estimates that there will be 
approximately 94,700 new cases of colon cancer in the United States in 1999, and that 
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colon cancer will be responsible for about 47,900 deaths. Colon cancer frequently 
metastasizes to the liver and the lung. 

[007] Unlike lung cancer, in which smoking has been identified as the prime 
etiologic factor responsible for the disease, the principle mechanisms underlying 
colon cancer are complex and incompletely understood. Dietary factors are believed 
to promote carcinogenesis, especially a high fat intake. At the molecular level, a 
multistep process involving a number of mutations is suspected in the progression of 
adenomas to colon tumors (Vogelstein et al, (1988) N. Engl. J Med. 319:525-532). 
The development and progression of colon cancer is driven by sequential mutations in 
three gene types: oncogenes, tumor suppressor genes and mismatch repair genes, 
which control the rate of mutations of other genes, including oncogenes and tumor 
suppressor genes. These mutations occur as a result of genetic predisposition 
(germline mutations) or in response to environmental factors (somatic mutations). 
[008] Several mutations that are associated with colon cancer have been identified. 
Germline mutations that have been linked to hereditary, or familial, colon cancer 
include the tumor suppressor gene adenomatous polyposis coli (APC (Lengauer et al. 
(1991) Science 253:665-669) and the mismatch-repair genes MutL and MutS 
(Modrich (1995) Phil. Trans. R. Soc. Lond. B 347:89-95; Kolodner (1996) Genes 
Dev. 10: 1433-1442). Defective APC has been implicated in familial adenomatous 
polyposis (FAP) and MutL and MutS in hereditary nonpolyposis colorectal cancer 
(HNPCC). Somatic mutations identified in association with sporadic colon cancer 
include the oncogenes K-ras, c-myc, and the tumor suppressor genes p53, APC, 
neurofibromatosis type I GTPase-activating protein (NF I GAP), deleted in colon 
cancer (DCC and mutated in colon cancer (MCC) (Midgley et al. (1999) Lancet 
353:391-399). 

[009] Conventional therapeutic approaches to treat colon cancer include surgical 
resection, radiation and chemotherapy, including adjuvant therapy. Gene therapeutic 
approaches include transfer of cytokine or immune antigen genes, transfer of enzyme- 
prodrug systems (see, e.g.. Ruber et al. (1993) Cancer Res. 53:4619-4626) and 
replacement of tumor suppressor genes (see, e.g., Venook et al. (1998) Proc. ASCO 
17:43 1 a) using viral vectors (Zwacka et al. (1998) Hematol. Oncol. Chn. North Am. 
12:595 615). 
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[0010] While several genes associated with colon cancer have been identified, 
identification of additional genes linked to development (or inhibition of 
development) of colon cancer can provide additional diagnostic tools and therapeutic 
targets. Identification of genes differentially expressed in colon cancer is particularly 
important in the advancement of drug discovery, diagnostic technologies, and the 
understanding of the progression and nature of colon cancer. The invention provides 
for identification of such differentially expressed genes. 

Microarrray Background 

[0011] DNA-based arrays can provide an efficient, high-throughput method to 
examine gene expression and genetic variability. For example, SNPs, or single 
nucleotide polymorphisms, are the most common type of human genetic variation. 
DNA-based arrays can dramatically accelerate the discovery of SNPs in hundreds and 
even thousands of genes. Likewise, such arrays can be used for SNP genotyping in 
which DNA samples from individuals or populations are assayed for the presence of 
selected SNPs. These approaches will ultimately lead to the systematic identification 
of all genetic variations in the human genome and the correlation of certain genetic 
variations with disease susceptibility, responsiveness to drug treatments, and other 
medically relevant information. (See, for example, Wang, D. G. et al. (1998) Science 
280:1077-1082.) 

[0012] DNA-based array technology is especially important for the rapid analysis of 
global gene expression patterns. For example, genetic predisposition, disease, or 
therapeutic treatment may directly or indirectly affect the expression of a large 
number of genes in a given tissue. In this case, it is useful to develop a profile, or 
transcript image, of all the genes that are expressed and the levels at which they are 
expressed in that particular tissue. A profile generated from an individual or 
population affected with a certain disease or undergoing a particular therapy may be 
compared with a profile likewise generated from a control individual or population. 
Such analysis does not require knowledge of gene function, as the expression profiles 
can subjected to mathematical analyses which simply treat each gene as a marker. 
Furthermore, gene expression profiles may help dissect biological pathways by 
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identifying all the genes expressed, for example, at a certain developmental stage, in a 
particular tissue, or in response to disease or treatment. (See, for example. Lander, E. 
S. et al. (1996) Science 274:536-539.) 

SUMMARY OF THE INVENTION 

[0013] In one aspect, the invention involves a method of assessing the efficacy of an 
oncological disorder treatment in a subject, wherein the method involves the steps of 
providing a test cell population capable of expressing one or more of the nucleic acid 
sequences of the present invention; detecting the expression of one or more of these 
nucleic acid sequences; comparing the expression to that of the nucleic acid sequences 
in a reference cell population whose cancerous stage is known; and identifying a 
difference in expression level, if present, between the test cell population and the 
reference cell population. In various embodiments, the subject can be a mammal, or, 
more preferably, a human. In other embodiments, the test cell population can be 
provided in vitro, ex vivo from a mammalian subject, or in vivo in a mammalian 
subject. The expression of the nucleic acid sequences may be either increased or 
decreased in the test cell population as compared to the reference cell population. 
[0014] In a further aspect, the invention involves a method of diagnosing an 
oncological disorder, wherein the method involves the steps of providing a test cell 
population capable of expressing one or more of the nucleic acid sequences of the 
present invention; detecting the expression of one or more of these nucleic acid 
sequences; comparing the expression to that of the nucleic acid sequences in a 
reference cell population whose cancerous stage is known; and identifying a 
difference in expression level, if present, between the test cell population and the 
reference cell population. In various embodiments, the subject can, be a mammal, or, 
more preferably, a human. In other embodiments, the test cell population can be 
provided in vitro, ex vivo from a mammalian subject, or in vivo in a mammalian , 
subject. The expression of the nucleic acid sequences may be either increased or 
decreased in the test cell population as compared to the reference cell population. 
[0015] In another aspect, the invention involves a method of identifying a test 
therapeutic agent for treating an oncological disorder in a subject involving the steps 
of providing a test cell population capable of expressing one or more of the nucleic 
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acid sequences of the present invention; contacting the test cell population with the 
test therapeutic agent; detecting the expression of one or more of these nucleic acid 
sequences; comparing the expression to that of the nucleic acid sequences in a 
reference cell population whose cancerous stage is known; and identifying a 
difference in expression level, if present, between the test cell population and the 
reference cell population. In different embodiments, the subject may be a mammal or, 
more preferably, a human. Additionally, the test therapeutic agent may be either a 
known oncological disorder agent or an unknown oncological disorder agent. The 
antagonist may be an antibody having selectivity to at least one of the polypeptides of 
the present invention. The oncological disorder to be treated can be selected from the 
following diseases or disorders: locally advanced tumors, human soft tissue sarcomas, 
metastatic cancer, including lymphatic metastases, blood cell malignancies including 
multiple myeloma, acute and chronic leukemias, and lymphomas, head and neck 
^ cancers including mouth cancer, larynx cancer and thyroid cancer, lung cancers 
including small cell carcinoma and non-small cell cancers, breast cancers including 
small cell carcinoma and ductal carcinoma, gastrointestinal cancers including 
esophageal cancer, stomach cancer, colon cancer, colorectal cancer and polyps 
associated with colorectal neoplasia, pancreatic cancers, liver cancer, urologic cancers . 
including bladder cancer and prostate cancer, malignancies of the female genital tract 
including ovarian carcinoma, uterine (including endometrial) cancers, and solid tumor 
in the ovarian follicle, kidney cancers including renal cell carcinoma, brain cancers 
including intrinsic brain tumors, neuroblastoma, astrocytic brain tumors, gliomas, 
metastatic tumor cell invasion in the central nervous system, bone cancers including 
osteomas, skin cancers including malignant melanoma, tumor progression of human 
skin keratinocytes, squamous cell carcinoma, basal cell carcinoma, 
hemangiopericytoma and Karposi's sarcoma. 

[0016] In a further aspect, the invention involves a method of identifying or 
determining the susceptibility to, predisposition to, or presence of, an oncological 
disorder in a subject. In this aspect, the method involves the steps of providing a test 
cell population capable of expressing one or more of the nucleic acid sequences of the 
present invention; detecting the expression of one or more of these nucleic acid 
sequences; comparing the expression to that of the nucleic acid sequences in a 
reference cell population whose cancerous stage is known; and identifying a 
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difference in expression level, if present, between the test cell population and the 
reference cell population. The subject may be a mammal, or, more preferably, a 
human. 

[0017] In an alternative aspect, the invention involves a method of treating an 
oncological disorder by administering an agent that modulates the expression or 
activity of one or more of the nucleic acid sequences of the present invention to a 
patient suffering from or at risk for developing the oncological disorder. This agent 
can be one that decreases the expression of one or more of sequences of the present 
invention that are up regulated in cancerous tissues. Alternatively, it can be one that 
increases the expression of one or more of sequences of the present invention that are 
down regulated. Additionally, the agent can be an antibody to a polypeptide encoded 
by the nucleic acid sequence, an antisense nucleic acid molecule, a peptide, a 
polypeptide agonist, a polypeptide antagonist, a peptidomimetic, a small molecule, or 
another drug. 

[0018] The invention also includes a kit containing one or more reagents for 
detecting two or more of the nucleic acid sequences of the present invention. 
Additionally, the invention involves an array of probe nucleic acids capable of 
detected two or more of the nucleic acids of the present invention. 
[0019] The polypeptides and nucleic acids of the invention can be used to treat an 
oncological disorder in a subject. Treatment of an oncological disorder may be in a 
mammal, preferably a human. In various embodiments, therapeutic compositions 
containing the polypeptides and nucleic acids of the invention can be used to treat 
oncological disorders. These therapeutic compositions can include a pharmaceutically 
acceptable carrier and, additionally, an active ingredient such as an anti-oncological 
agent or an anti-inflammatory agent. Also provided is a kit containing a therapeutic 
composition for use in the treatment of an oncological disorder along with a 
pharmaceutically acceptable carrier, wherein the theirapeutic composition is a 
polypeptide of the present invention, an agonist of a polypeptide of the present 
invention, or an antagonist of a polypeptide of the present invention. 
[0020] Also included in the invention is an isolated nucleic acid molecule that is at 
least 80% identical to the nucleic acid encoding the polypeptide of the present 
invention or the complement of the nucleic acid sequence, as well as vectors and host 
cells containing this nucleic acid sequence. Also provided is a method for producing a 
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polypeptide by culturing a host cell transformed with one or more vectors described 
herein under conditions suitable for the expression of the protein encoded by the 
vector. 

[0021] In another aspect, there is provided an isolated polypeptide encoded by an 
isolated nucleic acid sequence or oligonucleotide described herein. In some aspects, 
the isolated protein, functional variants or fragments thereof. In another embodiment, 
a variant or fragment of a protein of the present invention retains the respective 
activity. 

[0022] In still further aspects, the invention involves pharmaceutical compositions 
containing either the isolated nucleic acid, isolated polypeptide or antibody. Another 
aspect involves methods of detecting the presence of the nucleic acid and polypeptide. 
[0023] An additional aspect of the present invention is markers and methods of 
prognosticating or detecting oncological disorders based on the presence of nucleic 
acid or polypeptide of the. present invention in a biological sample. 
[0024] A further embodiment of the present invention is markers and methods for 
assessing the efficacy of anti cancer treatments based on monitoring the level of a 
nucleic acid or polypeptide of the present invention in a biological sample. 
[0025] Other features and advantages of the invention will be apparent from the 
following detailed description and from the claims. 

BRIEF DESCRIPTIQN OF THE DRAWINGS 

[0026] Figures 1-31. Analysis of putative signal sequence and transmembrane 
regions of potentially over-expressed tumor proteins. The first seventy N-terminal 
amino acids of each protein was analyzed the Hidden Markov Model (HMM) SignalP 
program for the presence of a signal sequence (left graph). In addition, each entire 
protein was examined by the TMHMM program for possible transmembrane regions 
(right graph). From this information, the subcellular location and membrane topology 
of each protein was predicted. Figures 1-31 shows the SignalP and TMHMM analysis 
of the protein sequences of PCTUC-5 (SEQ ID NO:39), PCTUC-93 (SEQ ID NO:46), 
PCTUC-190 (SEQ ID NO:57), PCTUC-239 (SEQ ID NO:65), PCTUC-246 (SEQ ID 
NO:40), PCTUC-360 (SEQ ID NO:53), PCTUC-462 (SEQ ID NO:58), PCTUC-468 
(SEQ ID NO:48), PCTUC-536 (SEQ ID NO:3114), PCTUC-582 (SEQ ID NO:64), 
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PCTUC-605 (SEQ ID NO:71), PCTUC-629 (SEQ ED NO:67), PCTUC-722 (SEQ ID 
NO:61), PCTUC-748 (SEQ ID NO:63), PCTUC-784 (SEQ ID NO:72), PCTUC-812 
(SEQ ID NO:66), PCTUC-856 (SEQ ID NO:49), PCTUC-898 (SEQ ID NO:43), 
PCTUC-935 (SEQ ID NO:70), PCTUC-936 (SEQ ID NO:42), PCTUC-986 (SEQ ID 
NO:47), PCTUC-991 (SEQ ID NO:75), PCTUC-992 (SEQ ID NO:60), PCTUC-1054 
(SEQ ID NO:59), PCTUC-1061 (SEQ ID NO:55), PCTUC-1073 (SEQ ID NO:56), 
PCTUC-1075 (SEQ ID NO:73), PCTUC-1078 (SEQ ID NO:68), PCTUC-1082 (SEQ 
ID NO:54), PCTUC-1 122 (SEQ ID NO:62), and PCTUC-250 (SEQ ID NO:41) 
respectively. 

DETAILED DESCRIPTION OF THE INVENTION 

[0027] The invention relates to genes associated with a cancerous state. It has been 
discovered that the level of expression of individual genes, also referred to as 
markers, and combinations of these genes, correlates with the presence of cancer in a 
patient. Methods are provided for detecting the presence of cancer in a sample^ the 
absence of cancer in a sample, the stage of a cancer, and with other characteristics of 
cancer that are relevant to prevention, diagnosis, characterization, and therapy of 
cancer in a patient. The invention also relates to small molecule or antibody 
therapeutics for the treatment of carcinomas. 

[0028] The present invention is based, in part, on identification of novel markers, 
which are over-expressed in colon cancer cells as compared to their expression in 
normal (i.e. non- cancerous) colon cells. The markers of the invention correspond to 
DNA, RNA, and polypeptide molecules, which can be detected in one or both of 
normal and cancerous colon cells. The enhanced expression of one or more of these 
markers in colon cells is herein correlated with the cancerous state of the tissue. The 
invention thus includes compositions, kits, and methods for assessing the cancerous 
state of colon cells (e.g. cells obtained from a human, cultured human cells, archived 
or preserved human cells and in vivo cells). 
[0029] The present invention relates to: 

[0030] An isolated polypeptide comprising an amino acid sequence selected from 
the group consisting of: 
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(a) a mature form of an amino acid sequence selected from the group 
consisting of SEQ ID NO:39; SEQ ID NO:40; SEQ ID NO:41 ; SEQ ID NO:42; SEQ 
ID NO:43; SEQ ID NO:46; SEQ ID NO:47; SEQ ID NO:48; SEQ ID NO:49; SEQ ID 
NO:53; SEQ ID NO:54; SEQ ID NO:55; SEQ ID NO:56; SEQ ID NO:57; SEQ ID 
NO:58; SEQ ID NO:59; SEQ ID NO:60; SEQ ID NO:61; SEQ ID NO:62; SEQ ID 
NO:63; SEQ ID NO:64; SEQ ID NO:65; SEQ ID NO:66; SEQ ID NO:67; SEQ ID 
NO:68; SEQ ID NO:70; SEQ ID NO:71; SEQ ID NO:72; SEQ ID NO:73; SEQ ID 
NO:74; SEQ ID NO:75; and SEQ ID NO:31 14; 

(b) a variant of a mature form of an amino acid sequence selected from 
the group consisting of SEQ ID NO:39; SEQ ID NO:40; SEQ ID NO:41; SEQ 
ID NO:42; SEQ ID NO:43; SEQ ID NO:46; SEQ ID NO:47; SEQ ID NO:48; 
SEQ ID NO:49; SEQ ID NO:53; SEQ ID NO:54; SEQ ID NO:55; SEQ ID 
NO:56; SEQ ID NO:57; SEQ ID NO:58; SEQ ID NO:59; SEQ ID NO:60; 
SEQ ID NO:61; SEQ ID NO:62; SEQ ID NO:63; SEQ ID NO:64; SEQ ID 
NO:65; SEQ ID NO:66; SEQ ID NO:67; SEQ ID NO:68; SEQ ID NO:70; 
SEQ ID NO:71; SEQ ID NO:72; SEQ ED NO:73; SEQ ID NO:74; SEQ ID 
NO:75; and SEQ ID NO:31 14, wherein one or more amino acid residues in 
said variant differs from the amino acid sequence of said mature form, 
provided that said variant differs in no more than 20%, of the amino acid 
residues from the amino acid sequence of said mature form; 

(c) an amino acid sequence comprising a sequence selected from the 
group consisting of SEQ ID NO:39; SEQ ID NO:40; SEQ ID NO:41; SEQ ID 
NO:42; SEQ ID NO:43; SEQ ID NO:46; SEQ ID NO:47; SEQ ID NO:48; 
SEQ ID NO:49; SEQ ID NO:53; SEQ ED NO:54; SEQ ID NO:55; SEQ ID 
NO:56; SEQ ED NO:57; SEQ ID NO:58; SEQ ID NO:59; SEQ ID NO:60; 
SEQ ID NO:61; SEQ ID NO:62; SEQ ID NO:63; SEQ ID NO:64; SEQ ID 
NO:65; SEQ ID NO:66; SEQ ID NO:67; SEQ ID NO:68; SEQ ID NO:70; 
SEQ ID NO:71 ; SEQ ID NO:72; SEQ BD NO:73; SEQ ID NO:74; SEQ ED 
NO:75; and SEQ ID NO:31 14; and 

(d) a variant of an amino acid sequence comprising a sequence 
selected from the group consisting of SEQ ID NO:39; SEQ ID NO:40; SEQ 
ID NO:41 ; SEQ ID NO:42; SEQ ID NO:43; SEQ JD NO:46; SEQ ID NO:47; 
SEQ ID NO:48; SEQ ID NO:49; SEQ ID NO:53; SEQ ID NO:54; SEQ ID 
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NO:55; SEQ ID NO:56; SEQ ID NO:57; SEQ ID NO:58; SEQ ID NO:59; 
SEQ ED NO:60; SEQ ID NO:61; SEQ ID NO:62; SEQ ID NO:63; SEQ ID 
NO:64; SEQ ID NO:65; SEQ ID NO:66; SEQ ID NO:67; SEQ ID NO:68; 
SEQ ID NO:70; SEQ ID NO:71; SEQ ID NO:72; SEQ ID NO:73; SEQ ID 
NO:74; SEQ ID NO:75; and SEQ ID NO:31 14, wherein one or more amino 
acid residues in said variant differs from the amino acid sequence, provided 
that said variant differs in no more than 20% of amino acid residues from said 
amino acid sequence. 

[0031] An isolated polypeptide wherein the amino acid sequence is 
selected from the group consisting of: SEQ ID NO:39; SEQ ID NO:40; SEQ 
ID NO:41 ; SEQ ID NO:42; SEQ ID NO:43; SEQ ID NO:46; SEQ ID NO:47; 
SEQ ID NO:48; SEQ ID NO:49; SEQ ID NO:53; SEQ ID NO:54; SEQ ID 
NO:55; SEQ ID NO:56; SEQ ID NO:57; SEQ ID NO:58; SEQ ID NO:59; 
SEQ ID NO:60; SEQ ID NO:61; SEQ ID NO:62; SEQ ID NO:63; SEQ ID 
NO:64; SEQ ID NO:65; SEQ ID NO:66; SEQ ID NO:67; SEQ ID NO:68; 
SEQ ID NO:70; SEQ ID NO:7 1 ; SEQ ID NO:72; SEQ ID NO:73; SEQ ID 
NO:74; SEQ ID NO:75; and SEQ ID NO:31 14. 

[0032] An isolated polypeptide wherein the amino acid sequence is 
selected from the group consisting of: SEQ ID NO:39; SEQ ID NO:56; SEQ 
ID NO:59; SEQ ID NO:61; SEQ ID NO:62; SEQ ID NO:65; and SEQ ID 
NO:68. 

[0033] An isolated polypeptide wherein the amino acid sequence is 
selected from the group consisting of: SEQ ID NO:43; SEQ ID NO:46; SEQ 
ED NO:47; SEQ ID NO:54; SEQ ID NO:55; SEQ ID NO:57; SEQ ID NO:58; 
SEQ ID NO:60; SEQ ID NO:63; SEQ ID NO:64; SEQ ID NO:66; SEQ ID 
NO:67; SEQ ID NO:70; SEQ ID NO:73, and SEQ ID NO:31 14. 

[0034] An isolated polypeptide wherein the amino acid sequence is 
selected from the group consisting of: SEQ ID NO:40; SEQ ID NO:41 ; SEQ 
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ID NO:42; SEQ ID NO:48; SEQ ID NO:49; SEQ ID NO:53; SEQ ID NO:71; 
SEQ ID NO:72; and SEQ ID NO:75. 

[0035] An isolated polypeptide wherein the amino acid sequence is 
selected from the group consisting of: SEQ ID NO: 39; SEQ ID NO:40; SEQ 
ID NO:4 1 ; SEQ ED NO:42; and SEQ ID NO:43. 

[0036] An isolated polypeptide wherein said amino acid sequence is 
encoded by a nucleic acid sequence selected from the group consisting of SEQ 
ID NO: 1 ; SEQ ID NO:2; SEQ ID NO:3; SEQ ID NO:4; SEQ ID NO:5; SEQ 
ID NO:8; SEQ ID NO:9; SEQ ID NO: 10; SEQ ID NO: 1 1 ; SEQ ID NO: 15; 
SEQ ID NO: 16; SEQ ID NO: 17; SEQ ID NO: 18; SEQ ID NO: 19; SEQ ID 
NO:20; SEQ ID NO:21 ; SEQ ID NO:22; SEQ ID NO:23; SEQ ID NO:24; 
SEQ ID NO:25; SEQ ID NO:26; SEQ ID NO:27; SEQ ID NO:28; SEQ ID 
NO:29; SEQ ID n6:30; SEQ ID NO:32; SEQ ID NO:33; SEQ ID NO:34; 
SEQ ID N0:35; SEQ ID NO:37; and SEQ ID NO:31 13. 

[0037] An polypeptide wherein said polypeptide comprises an amino acid 
sequence of a naturally-occurring allelic variant of an amino acid sequence 
selected from the group consisting of SEQ ID NO:39; SEQ ID NO:40; SEQ 
ID NO:41 ; SEQ ID NO:42; SEQ ID NO:43; SEQ ID NO:46; SEQ ID NO:47; 
SEQ ID NO:48; SEQ ID NO:49; SEQ ID NO:53; SEQ ID NO:54; SEQ ID 
NO:55; SEQ ID NO:56; SEQ ED NO:57; SEQ ID NO:58; SEQ ID NO:59; 
SEQ ID NO:60; SEQ ID NO:61 ; SEQ ID NO:62; SEQ ID NO:63; SEQ ID 
NO:64; SEQ ID NO:65; SEQ ID NO:66; SEQ ID NO:67; SEQ ID NO:68; 
SEQ ID NO:70; SEQ ID NO:7 1 ; SEQ ID NO:72; SEQ ID NO:73; SEQ ID 
NO:74; SEQ ID NO:75; and SEQ ID NO:31 14. 

[0038] An polypeptide wherein said allelic variant comprises an amino 
acid sequence that is the translation of a nucleic acid sequence differing by a 
single nucleotide from a nucleic acid sequence selected from the group 
consisting of SEQ ID NO: 1 ; SEQ ID NO:2; SEQ ID NO:3; SEQ ID NO:4; 
SEQ ID NO:5; SEQ ID NO:8; SEQ ID NO:9; SEQ ID NO: 10; SEQ ID 
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NO: 1 1 ; SEQ ED NO: 1 5; SEQ ID NO: 1 6; SEQ ID NO: 17; SEQ ID NO: 1 8; 
SEQ ID NO: 19; SEQ ID NO:20; SEQ ID NO:21; SEQ ID NO:22; SEQ ID 
NO:23; SEQ ID NO:24; SEQ ID NO:25; SEQ ID NO:26; SEQ ID NO:27; 
SEQ ID NO:28; SEQ ID NO:29; SEQ ID NO:30; SEQ ID NO:32; SEQ ID 
NO:33; SEQ ID NO:34; SEQ ID NO:35; SEQ ID NO:37; and SEQ ID 
NO:3113. 

[0039] A polypeptide variant wherein the amino acid substitution is a 
conservative amino acid. 

[0040] A nucleic acid molecule selected from the group consisting of SEQ 
ID NO:l; SEQ ID NO:2; SEQ ID NO:3; SEQ ID NO:4; SEQ ID NO:5; SEQ 
ID NO:8; SEQ ID NO:9; SEQ ID NO: 10; SEQ ED NO: 1 1 ; SEQ ID NO: 15; 
SEQ ID NO: 16; SEQ ID NO: 17; SEQ ID NO: 1 8; SEQ ID NO: 19; SEQ ID 
NO:20; SEQ ID NO:21 ; SEQ ID NO:22; SEQ ID NO:23; SEQ ID NO:24; 
SEQ ID NO:25; SEQ ID NO:26; SEQ ID NO:27; SEQ ID NO:28; SEQ ID 
NO:29; SEQ ID NO:30; SEQ ID NO:32; SEQ ID NO:33; SEQ ID NO:34; 
SEQ ID NO:35; SEQ ID NO:37; and SEQ ID NO:31 13. 

[0041] A nucleic acid molecule selected from the group consisting of SEQ 
ID NO:l; SEQ ID NO: 18; SEQ ID NO:21; SEQ ID NO:23; SEQ ID NO:24; 
SEQ ID NO:27; and SEQ ED NO:30. 

[0042] A nucleic acid molecule selected from the group consisting of SEQ 
ID NO:5; SEQ ID NO:8; SEQ ID NO:9; SEQ ID NO: 16; SEQ ID NO: 17; 
SEQ ID NO: 19; SEQ ID NO:20; SEQ ID NO:22; SEQ ID NO:25; SEQ ID 
NO:26; SEQ ID NO:28; SEQ ID NO:29; SEQ ID NO:32; SEQ ID NO:35; and 
SEQIDNO:3n3. 

[0043] A nucleic acid molecule selected from the group consisting of SEQ 
ID NO:2; SEQ ID NO:3; SEQ ID NO:4; SEQ ID NO: 10; SEQ ID NO: 1 1 ; 
SEQ ID NO: 15; SEQ ID NO:33; SEQ ID NO:34; SEQ ID NO:37. 
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[0044] A nucleic acid molecule selected from the group consisting of SEQ 
ID NO: 1 ; SEQ ID NO:2; SEQ ID NO:3; SEQ ID NO:4; and SEQ ID NO:5. 

' [0045] A nucleic acid molecule that differs by a single nucleotide from a 
nucleic acid sequence selected from the group consisting of: SEQ ID NO: 1 ; 
SEQ ID NO:2; SEQ ID NO:3; SEQ ID NO:4; SEQ ID NO:5; SEQ ID NO:8; 
SEQ ID NO:9; SEQ ID NO: 10; SEQ ID NO: 1 1 ; SEQ ID NO: 15; SEQ ID 
NO: 16; SEQ ID NO: 17; SEQ ID NO: 18; SEQ ID NO: 19; SEQ ID NO:20; . 
SEQ ID NO:21; SEQ ID NO:22; SEQ ED NO:23; SEQ ID NO:24; SEQ ID 
NO:25; SEQ ID NO:26; SEQ ID NO:27; SEQ ID NO:28; SEQ ID NO:29; 
SEQ ID NO:30; SEQ ID NO:32; SEQ ID NO:33; SEQ ID NO:34; SEQ ID 
NO:35; SEQ ID NO:37; and SEQ ID NO:3 11 3. 

[0046] A nucleic acid molecule that hybridizes under stringent conditions 
to a nucleotide sequence selected from the group consisting of: SEQ ID NO:l; 
SEQ ID NO:2; SEQ ID NO:3; SEQ ED NO:4; SEQ ID NO:5; SEQ ID NO:8; 
SEQ ID NO:9; SEQ ID NO: 10; SEQ ID NO: 1 1 ; SEQ ID NO: 15; SEQ ID 
NO:l6; SEQ ID NO:17; SEQ ID NO:18; SEQ ID NO:19; SEQ ID NO:20; 
SEQ ID NO:2 1 ; SEQ ID NO:22; SEQ ID NO:23; SEQ ID NO:24; SEQ ID 
NO:25; SEQ ID NO:26; SEQ ID NO:27; SEQ ID NO:28; SEQ ID NO:29; 
SEQ ID NO:30; SEQ ID NO:32; SEQ ID NO:33; SEQ ID NO:34; SEQ ID 
NO:35; SEQ ID NO:37; and SEQ ID NO:3 1 13, or a complement of said 
nucleotide sequence. 

[0047] A vector comprising a nucleic acid molecule of the present 
invention. 

[0048] A vector comprising a nucleic acid molecule of the present 
invention oberably linked to a promoter. 

[0049] A cell transformed or transfected with a vector of vector of the 
present invention. 
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[0050] A cell transformed or transfected with a nucleic acid sequence 
selected from the group SEQ ID NO: 1 ; SEQ ID NO:2; SEQ ID NO:3; SEQ ID 
NO:4; SEQ ID NO:5; SEQ ID NO:8; SEQ ID NO:9; SEQ ID NO: 10; SEQ ID 
NO: 1 1 ; SEQ ID NO: 15; SEQ ID NO: 16; SEQ ID NO: 1 7; SEQ ID NO: 1 8; 
SEQ ID NO: 19; SEQ ID NO:20; SEQ ID NO:21 ; SEQ ID NO:22; SEQ ID 
NO:23; SEQ ID NO:24; SEQ ID NO:25; SEQ ID NO:26; SEQ ID NO:27; 
SEQ ID NO:28; SEQ ED NO:29; SEQ ID NO:30; SEQ ID NO:32; SEQ ID 
NO:33; SEQ ID NO:34; SEQ ID NO:35; SEQ ID NO:37; and SEQ ID 
NO:3113 

[0051] A microarray comprising nucleic acid sequences selected from the 
group consisting of SEQ ID N0.77 through SEQ ID NO:301 1. 

[0052] A microarray comprising nucleic acid sequences selected from the 
group consisting of SEQ ID NO:77 through SEQ ID NO: 1993. 

[0053] A microarray comprising nucleic acid sequences selected from the 
group consisting of SEQ ID NO: 1994 through SEQ ID NO:301 1 . 

[0054] A microarray comprising a nucleic acid sequence selected from the 
group consisting of SEQ ID NO:l; SEQ ID NO:2; SEQ ED NO:3; SEQ ID 
NO:4; SEQ ID NO:5; SEQ ID NO:8; SEQ ID NO:9; SEQ ID NO: 10; SEQ ID 
NO: 1 1 ; SEQ ID NO: 15; SEQ ED NO: 16; SEQ ID NO: 17; SEQ ID NO: 1 8; 
SEQ ID NO: 19; SEQ ID NO:20; SEQ ID NO:2 1 ; SEQ ID NO:22; SEQ ID 
NO:23; SEQ ID NO:24; SEQ ID NO:25; SEQ ID NO:26; SEQ ID NO:27; 
SEQ ID NO:28; SEQ ID NO:29; SEQ ID NO:30; SEQ ED NO:32; SEQ ID 
NO:33; SEQ ID NO:34; SEQ ID NO:35; SEQ ID NO:37; and SEQ ID 
NO:3113. 

[0055] A microarray comprising a nucleic acid sequence selected from the 
group consisting of SEQ ED NO: 1 ; SEQ ID NO: 1 8; SEQ ID NO:2 1 ; SEQ ID 
NO:23; SEQ ID NO:24; SEQ ED NO:27; and SEQ ID NO:30. 
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[0056] A microarray comprising a nucleic acid sequence selected from the 
group consisting of SEQ ID NO: 1 ; SEQ ID NO: 1 8; SEQ ID NO:2 1 ; SEQ ID 
NO:23; SEQ ID NO:24; SEQ ID NO:27; and SEQ ID NO:30. 

[0057] A microarray comprising a nucleic acid sequence selected from the 
group consisting of SEQ ID NO:5; SEQ ED NO:8; SEQ ID NO:9; SEQ ID 
NO:16; SEQ ID NO:17; SEQ ED NO:19; SEQ ID NO:20; SEQ ID NO:22; 
SEQ ID NO:25; SEQ ID NO:26; SEQ ID NO:28; SEQ ID NO:29; SEQ ID 
NO:32; SEQ ID NO:35; and SEQ ID NO:31 13. 

[0058] A microarray comprising a nucleic acid sequence selected from the 
group consisting of SEQ ID NO:2; SEQ ID NO:3; SEQ ID NO:4; SEQ ID 
NO: 10; SEQ ID NO: 11; SEQ ID NO: 15; SEQ ID NO:33; SEQ ED NO:34; and 
SEQ n> NO:37. 

[0059] A microarray comprising a nucleic acid sequence selected from the 
group consisting of SEQ ID NO: 1 ; SEQ ID NO:2; SEQ ID NO:3; SEQ ED 
NO:4; and SEQ ID NO:5. 

[0060] An antibody that immunospecifically-binds to the polypeptide of 
SEQ ED NO:39; SEQ ID NO:40; SEQ ID NO:41 ; SEQ ID NO:42; SEQ ID 
NO:43; SEQ ID NO:46; SEQ ID NO:47; SEQ ID NO:48; SEQ ID NO:49; 
SEQ ID NO:53; SEQ ID NO:54; SEQ ID NO:55; SEQ ID NO:56; SEQ ID 
NO:57; SEQ ID NO:58; SEQ ID NO:59; SEQ ID NO:60; SEQ ID NO:61; 
SEQ ID NO:62; SEQ ID NO:63; SEQ ID NO:64; SEQ ID NO:65; SEQ ID 
NO:66; SEQ ID NO:67; SEQ ID NO:68; SEQ ID NO:70; SEQ ID NO:71; 
SEQ ED NO:72; SEQ ID NO:73; SEQ ED NO:74; SEQ ID NO:75; and SEQ 
EDNO:3114. 

[0061] The antibody may be monoclonal antibody, an antibody fragment 
selected from but not limited to a FV fragment, a Fab fragment, (Fab)2 
fragment, a single chain antibody. 
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[0062] The antibody may be a conjugated with at least one polyethylene 
glycol moiety. 

[0063] The antibody may be an antagonist. 

[0064] The antibody may be a humanized antibody or a human antibody. 

[0065] A method for determining the presence or amount of the 
polypeptide of the present invention in a sample, the method comprising 

(a) providing the sample; 

(b) contacting the sample with an antibody that binds immunospecifically to 
the polypeptide; and 

(c) determining the presence or amount of antibody bound to said polypeptide; 
thereby determining the presence or amount of polypeptide in said sample. 

[0066] A method for determining the presence or amount of the nucleic 
acid molecule of the present invention in a sample, the method comprising: 

(a) providing the sample; 

(b) contacting the sample with a probe that binds to said nucleic acid 
molecule; and 

(c) determining the presence or amount of the probe bound to said nucleic acid 
molecule; 

thereby determining the presence or amount of the nucleic acid molecule in 
said sample. 

[0067] A method of identifying an agentjhat binds to a polypeptide of 
claim 1, the method comprising: 

(a) contacting said polypeptide with said agent; and 

(b) determining whether said agent binds to said polypeptide. 

[0068] A method for identifying an agent that modulates the expression or 
activity of the polypeptide of claim 1, the method comprising: 
(a) providing a cell expressing said polypeptide in an operational 
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manner; 

(b) contacting the cell with said agent; and 

(c) determining whether the agent modulates expression or activity of 

said polypeptide; 

whereby an alteration in expression or activity of said peptide indicates said 
agent modulates expression or activity of said polypeptide. 

[0069] A method for modulating the activity of the polypeptide of the 
present invention, the method comprising contacting a cell sample expressing 
the polypeptide with a compound that binds to the polypeptide in an amount 
sufficient to modulate the activity of the polypeptide. 

[0070] A method of treating or preventing a cancer-associated disorder, 
said method comprising administering to a subject in which such treatment or 
prevention is desired the polypeptide of present invention in an . amount 
sufficient to treat or prevent said cancer-associated disorder in said subject. 

[0071] A method of treating or preventing a cancer-associated disorder, 
. said method comprising administering to a subject in which such treatment or 
prevention is desired an antibody of the present invention in an amount 
sufficient to treat or prevent said cancer-associated disorder in said subject. 

[0072] A pharmaceutical composition comprising a polypeptide of the 
present invention and at least one pharmaceutically acceptable carrier, 

[0073] A kit comprising the pharmaceutical composition of the present 
invention. 

[0074] A method of detecting differentially expressed genes correlated 
with a cancerous state of a mammalian cell, the method comprising the step of 
detecting at least one differentially expressed gene product in a test sample 
derived from a cell suspected of being cancerous, where the gene product is 
encoded by a sequence of SEQ ID NO: 1 ; SEQ ID NO:2; SEQ ID NO:3; SEQ 
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ID NO:4; SEQ ID NO:5; SEQ ID NO:8; SEQ ID NO:9; SEQ ID NO: 10; SEQ 
ID NO: 1 1 ; SEQ ID NO: 15; SEQ ID NO: 1 6; SEQ ID NO: 17; SEQ ID NO: 18; 
SEQ ID NO: 19; SEQ ID NO:20; SEQ ID NO:21; SEQ ID NO:22; SEQ ID 
NO:23; SEQ ID NO:24; SEQ ID NO:25; SEQ ID NO:26; SEQ ID NO:27; 
SEQ ID NO:28; SEQ ID NO:29; SEQ ID NO:30; SEQ ID NO:32; SEQ ID 
NO:33; SEQ ID NO:34; SEQ ID NO:35; SEQ ED NO:37; and SEQ ID 
NO:31 13 wherein detection of differentially expressed product is correlated 
with a cancerous state of the cell from which the test sample was derived. 

[0075] A method for detecting the presence of a nucleic acid molecule of 
the present invention in a sample comprising: 

(a) contacting the sample with a nucleic acid probe or primer which 
selectively hybridizes to the nucleic acid molecule; and 

(b) determining whether the nucleic acid probe or primer binds to a nucleic 
acid molecule in the sample to thereby detect the presence of a nucleic acid 
molecule in the sample. 

[0076] A method for detecting the presence of a nucleic acid molecule of 
the present invention in a sample comprising: 

(a) contacting the sample with a nucleic acid probe or primer which 
selectively hybridizes to the nucleic acid molecule; and ^ 

(b) determining whether the nucleic acid probe or primer binds to a nucleic acid 
molecule in the sample to thereby detect the presence of a nucleic acid molecule in 
the sample. 

[0077] A method for detecting the presence of a nucleic acid molecule 
selected from the group consisting of SEQ ID NO:77 through SEQ ID 
NO:301 1 in a sample comprising: 

(a) contacting the sample with a nucleic acid probe, or primer which 
selectively hybridizes to the nucleic acid molecule; and 

(b) determining whether the nucleic acid probe or primer binds to a nucleic 
acid molecule in the sample to thereby detect the presence of a nucleic acid 
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molecule selected from the group consisting of SEQ ID NO:77 through SEQ 
ID NO:301 1 in the sample. 

[0078] A method for detecting the presence of a nucleic acid molecule 
selected from the group consisting of SEQ ID NO:77 through SEQ ID 
NO: 1993 in a sample comprising: 

(a) contacting the sample with a nucleic acid probe or primer which 
selectively hybridizes to the nucleic acid molecule; and 

(b) determining whether the nucleic acid probe or primer binds to a nucleic 
acid molecule in the sample to thereby detect the presence of a nucleic acid 
molecule selected from the group consisting of SEQ ID NO:77 through SEQ 
ID NO: 1 993 in the sample. 

[0079] A method for detecting the presence of a nucleic acid molecule 
selected from the group consisting of SEQ ID NO: 1994 through SEQ ID 
NO: 301 1 in a sample comprising: 

(a) contacting the sample with a nucleic acid probe or primer which 
selectively hybridizes to the nucleic acid molecule; and 

(b) determining whether the nucleic acid probe or primer binds to a nucleic 
acid molecule in the sample to thereby detect the presence of a nucleic acid 
molecule selected from the group consisting of SEQ ID NO: 1994 through 
SEQ ID NO:301 1 in the sample. 

[0080] A method for monitoring the progression of a cancer in a patient, 
the method comprising: 

(a) detecting in a patient sample at a first point in time, the expression of a 
marker, wherein the marker a nucleic acid molecule of the present invention; 

(b) repeating step a) at a subsequent point in time; and 

(c) comparing the level of expression detected in steps a) and b), and 
therefrom monitoring the progression of the cancer. 
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[0081] A method for monitoring the progression of a cancer in a patient, 
the method comprising: 

(a) detecting in a patient sample at a first point in time, the expression of a 
marker, wherein the marker a nucleic acid molecule of of the present 
invention; 

(b) repeating step a) at a subsequent point in time; and 

(c) comparing the level of expression detected in steps a) and b), and 
therefrom monitoring the progression of the cancer. 

[0082] A method for monitoring the progression of a cancer in a patient, 
the method comprising: 

(a) detecting in a patient sample at a first point in time, the expression of a 
marker, wherein the marker is a nucleic acid molecule selected from the group 
consisting of SEQ ID NO:77 through SEQ ID NO:301 1 ; 
/ (b) repeating step a) at a subsequent point in time; and 

(c) comparing the level of expression detected in steps a) and b), and 
therefrom monitoring the progression of the cancer. 

[0083] A method for monitoring the progression of a cancer in a patient, 
the method comprising: 

(a) detecting in a patient sample at a^first point in time, the expression of a 
marker, wherein the marker is a nucleic acid molecule selected from the group 
consisting of SEQ ID NO:77 through SEQ ID NO: 1993; 

(b) repeating step a) at a subsequent point in time; and 

(c) comparing the level of expression detected in steps a) and b), and 
therefrom monitoring the progression of the cancer. 

[0084] A method for monitoring the progression of a cancer in a patient, 
the method comprising: 

(a) detecting in a patient sample at a first point in time, the expression of a 
marker, wherein the marker is a nucleic acid molecule selected from the group 
consisting of SEQ ID NO: 1994 through SEQ ID NO:301 1 ; 

(b) repeating step a) at a subsequent point in time; and 
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(c) comparing the level of expression detected in steps a) and b), and 
therefrom monitoring the progression of the cancer. 

[0085] A method of assessing the efficacy of a test compound for 
inhibiting a cancer in a patient, the method comprising comparing: 

(a) expression of a marker in a first sample obtained from the patient exposed 
to the test compound, wherein the marker is selected the nucleic acid molecule 
of the present invention, and 

(b) expression of the marker in a second sample obtained from the patient, 
wherein the sample is not exposed to the test compound, wherein a 
significantly lower level of expression of the marker in the first sample, 
relative to the second sample, is an indication that the test compound is 
efficacious for inhibiting the cancer in the patient. 

[0086] A method of assessing the efficacy of a test compound for 
inhibiting a cancer in a patient, the method comprising comparing: 

(a) expression of a. marker in a first sample obtained from the patient exposed 
to the test compound, wherein the marker is selected the a nucleic acid 
molecule selected from the group consisting of SEQ ID NO:77 through SEQ 
ID NO: 1993, and 

(b) expression of the marker in a second sample obtained from the patient, 
wherein the sample is not exposed to the test compound, wherein a 
significantly lower level of expression of the marker in the first sample, 

. relative to the second sample, is an indication that the test compound is 
efficacious for inhibiting the cancer in the patient. 

[0087] A method of assessing the efficacy of a therapy for inhibiting a 
cancer in a patient, the method comprising comparing: 
(a) expression of a marker in the first sample obtained from the patient prior 
to providing at least a portion of the therapy to the patient, wherein the marker 
is a nucleic acid sequence selected from the group consisting of SEQ ID 
NO: 1994 through SEQ ID NO:301 1 , and 
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(b) expression of the marker in a second sample obtained from the patient 
following provision of the portion of the therapy, wherein a significantly lower 
level of expression of the marker in the second sample, relative to the first 
sample, is an indication that the therapy is efficacious for inhibiting the cancer 
in the patient. 

[0088] A method of selecting a composition for inhibiting a cancer in a 
patient, the method comprising: 

(a) obtaining a sample comprising cancer cells from the patient; 

(b) separately exposing aliquots of the sample in the presence of a plurality of 
test compositions; 

(c) comparing expression of a marker in each of the aliquots, wherein the 
marker is the nucleic acid of of the present invention; and 

(d) selecting one of the test compositions which alters the level of expression 
of the marker in the aliquot containing that test composition, relative to other 
test compositions. 

[0089] A method of selecting a composition for inhibiting a cancer in a 
patient, the method comprising: 

(a) obtaining a sample comprising cancer cells from the patient; 

(b) separately exposing aliquots of the sample in the presence of a plurality of 
test compositions; 

(c) comparing expression of a marker in each of the aliquots, wherein the 
marker is the nucleic acid of the present invention; and 

(d) selecting one of the test compositions which alters the level of expression 
of the marker in the aliquot containing that test composition, relative to other 
test compositions. 

[0090] A method of selecting a composition for inhibiting a cancer in a 
patient, the method comprising: 

(a) obtaining a sample comprising cancer cells from the patient; 

(b) separately exposing aliquots of the sample in the presence of a plurality of 
test compositions; 
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(c) comparing expression of a marker in each of the aliquots, wherein the 
marker is a nucleic acid sequence selected from the group consisting of SEQ . 
IDNOs:77-1993;and 

(d) selecting one of the test compositions which alters the level of expression 
of the marker in the aliquot containing that test composition, relative to other 
test compositions. 

[0091] A method of selecting a composition for inhibiting a 

cancer in a patient, the method comprising: 

(a) obtaining a sample comprising cancer cells from the patient; 

(b) separately exposing aliquots of the sample in the presence of a plurality of 
test compositions; 

(c) comparing expression of a marker in each of the aliquots, wherein the 
marker is a nucleic acid sequence selected from the group consisting of SEQ 
ID NOs: 1994-3011; and 

(d) selecting one of the test compositions which alters the level of expression 
of the marker in the aliquot containing that test composition, relative to other 
test compositions. 

[0092] An antisense compound 8 to 30 nucleobases in length targeted to a nucleic 
acid molecule of the present invention encoding the polypeptide of the present 
invention, wherein said antisense compound specifically hybridizes with and inhibits 
the expression of the polypeptide. 

[0093] Transcriptional profiling analysis was performed on primary colon tumors. 
The mRNA was isolated from the colon tumors and mRNA was isolated from normal 
colons. Transcripts that were differentially expressed greater than 2 fold and in 30% 
or more of the tumors were further analyzed by Signal P analysis and Trans 
Membrane Hidden Markov Models (TMHMM) to determine if they were membrane 
associated or secreted. The transcripts determined to be associated with the 
membrane were then further validated by Sybrgreen (real time PGR) analysis on an 
additional set of colon and breast tumors compared to corresponding normal tissue. 
[0094] These sequences that were differentially expressed can be used to develop 
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new small molecule and antibody therapeutics for the treatment of carcinoma, 
including colon, lung, breast, and prostate tumors. All of these sequences also 
provide potential biomarkers and efficacy markers for carcinoma. The sequences can 
be used to generate antibody reagents, full length clones, and can be evaluated in in 
vitro and in vivo assays to determine function. 

[0095] The present invention discloses the nucleic acid sequences SEQ ED 
NOs: 1994-301 1, which are down-regulated in human colon tumor cells. 
[0096] The present invention discloses the nucleic acid sequences SEQ ID NOs: 1- 
11, SEQ ID NOs: 13-38, SEQ ID NOs:77-1993, and SEQ ID NO:31 13, which are up- 
regulated in human colon tumor cells. 

[0097] The present invention discloses 3 1 transcripts, and the proteins encoded by 
them that are up-regulated in human colon tumor cells. A summary of these 
differentially expressed genes is included in Table 1 . 



TABLE 1 



Transcript 
De s i gnat: 1 on 


Gene Name 


GenBank 
acc nTiznber 


Gene 
sequence 


Protein 
sequence 


PCTUC_5 


(p-cadherin) 


NM 001793 


SEQ ID N0:1 


SEQ ID NO: 39 


PCTUC_93 


(opiate binding- 
cell adhesion 
molecule) 


NM 002545 


SEQ ID NO: 8 


SEQ ID NO: 46 


PCTUC_190 


(organic anion 
transporter 
polypeptide-related 

protein 1, OATPRPl) 


NM016354 


SEQ ID NO: 19 


SEQ ID NO: 57 


PCTUC_239 


( care inoembryonic 
antigen- related 
cell adhesion 
molecule 5) 
(CEACAMS) 


NM 004363 


SEQ ID NO: 27 


SEQ ID NO: 65 


PCTUC_246 


(RON kinase) 


NM 002447 


SEQ ID NO: 2 


SEQ ID NO: 40 


PCTUC_250 


G-protein couple 
receptor ( GPRS 6 ) 


NM 005682 


SEQ ID NO: 3 


SEQ ID NO: 41 


PCTUC_360 


(STEAPl) 


NM 012449 


SEQ ID NO: 15 


SEQ ID NO: 53 
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Transcript 
Designation 


Gene Name 


GenBank 
acc ni2jnber 


Gene 
sequence 


Prot in 
sequence 


PCTUC_462 


(bumetanide- 
sensitive Na-K-Cl 
cotranspor ter , 
NKCCl ) 


NM 001046 


SEQ ID NO: 20 


SEQ ID NO: 58 


PCTUC_468 


(neuroligin_l , 
NLGNl ) 


NM 014932 


SEQ ID NO: 10 


SEQ ID NO: 48 


PCTDC_536 


(COLNOVl) 


NM 033408 


SEQ ID 
NO:3113 


SEQ ID 
NO: 3114 


PCTUC_582 


(CEACAM8) 


NM 001816 


SEQ ID NO: 26 


SEQ ID NO: 64 


PCTUC_605 


(epican, CD44) 


NM 000610 


SEQ ID NO: 33 


SEQ ID NO: 71 


PCTUC_629 


( monoc ar boxy 1 a t e 
transporter) (MCT3 ) 


NM 004207 


SEQ ID NO: 29 


SEQ ID NO: 67 


PCTUC_722 


^( integrin 
beta ( 4 ) subunit ) 


NM 000213 


SEQ ID NO: 23 


SEQ ID NO: 61 


PCTUC_748 


* (T245 protein, 
T245) 


NM 003270 


SEQ ID NO: 25 


SEQ ID NO: 63 


PCTUC_784 


(membrane co factor 
protein, CD46, MCP) 


NM 002389 


SEQ ID NO: 34 


SEQ ID NO: 72 


PCTUC_812 


(protocadherin-9 ) 
(PCDH9) 


NM 020403 


SEQ ID NO: 28 


SEQ ID NO: 66 


PCTUC_856' 


( extraneuronal 

monoamine 
transporter) 


NM 021977 


SEQ ID NO: 11 


SEQ ID NO: 49 


PCTUC_898 


(KIAA0792) 


NM 014698 


SEQ ID NO: 5 


SEQ ID NO: 43 


PCTUC_935 


(voltage gated 
potassium channel, 
KCNQl ) 


NM 000218 


SEQ ID NO: 32 


SEQ ID NO: 70 


PCTUC_936 


(FGFR3) 


NM 000142 


SEQ ID NO: 4 


SEQ ID NO: 42 


PCTUC_986 


( senescence- 
associated 
epithelial membrane 
protein, SEMPl) 


NM 021101 


SEQ ID NO:.9 


SEQ ID NO: 47 


PCTUC_991 


(osteoblast 
specific factor 2, 

0SF-2OS) 


NM 006475 


SEQ ID NO: 37 


SEQ ID NO: 75 


PCTUC__992 


(NRAMP2) 


NM 000617 


SEQ ID NO: 22 


SEQ ID NO: 60 
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Transcript 
Designation 


Gene Name 


GenBank 
acc number 


Gene 
se<2uence 


Protein 
se<xuence 


PCTUC_1054 


(polymorphic 
epithelial mucin) 
(PEM) 


NM 002456 


SEQ ID NO: 21 


SEQ ID NO: 59 


PCTUC_1061 


(KIAA0779) 


AB018322 


SEQ ID NO: 17 


SEQ ID NO: 55 


PCTUC_1073 


(decay accelerating 

factor for 
complement, CD55, 
DAF) 


NM 000574 


SEQ ID NO: 18 


SEQ ID NO: 56 


PCTUC_1075 


( solute carrier 
family 6 member 6, 
neurotransmitter 
transporter, 
taurine) 


NM 003043 


SEQ ID NO: 35 


SEQ ID NO: 73 


PCTUC_1078 


(osteopontin) 


NM 000582 


SEQ ID NO: 30 


SEQ ID NO: 68 


PCTUC_1082 


(claudin 2) 


NM 020384 


SEQ ID NO: 16 


SEQ ID NO: 54 


PCTUC_1122 


(hepatoma 
transmembrane 
kinase ligand, HTK 
ligand) 


NM 004093 


SEQ ID NO: 24 


SEQ ID NO: 62 



[0098] The following gene descriptions are shown as representative of the types 
of genes discovered in the present invention by the process described above. Since 
some of the genes are currently know to be associated with colon cancer, the 
identification of these genes by this method serves to validate the approach, such as: 
SEQ ID NO:l; SEQ ID NO: 18; SEQ ID NO:21; SEQ ID NO:23; SEQ ID NO:24; 
SEQ ID NO:27; SEQ ID NO:30. 

Description of representtive overexpressed genes found in human colon tumors. 
Opiate (opioid) binding-cell adhesion molecule (OBCAM) 

[0099] OBCAM as first identified by Cho et aK, Proc Natl Acad Sci 80: 5 176-80, 
(1983), was characterized as similar to other cell-adhesion molecules and named 
OBCAM by Schofield et aL, EMBO J Feb 8(2): 489-95, (1989). The gene was cloned 
and sequenced from the human brain by Shark and Lee, Gene 155: 213-17, (1995). It 
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was shown to be a neuron specific protein, presumed to play a role as a cell 
adhesion/recognition molecule, but its function has not been fully elucidated. Indirect 
evidence indicates a role for this protein in opioid function. Studies have suggested 
that the function of OBCAM in the brain may involve axonal outgrowth. 

SEMPl (Claudin 1: CLDNl: Senescence-associated epithelial membrane protein 1) 
[00100] Swisshelm et al., Gene 226: 285-95, (1999) isolated cDNAs encoding 

SEMPl from human mammary epithelial cells. The mRNA was shown to be 
expressed in human tissues, including adult and fetal liver, pancreas, placenta, 
adrenals, prostate and ovary, but at low or undetectable levels in a number of human 
breast cancer cell lines. It is a member of a superfamily of epithelial membrane 
proteins (EMPs), which may have multiple potential functions, including maintenance 
and regulation of cell polarity and permeability, perhaps through mechanisms 
involving tight junctions, (ibid). First identified by Furuse et al., J Cell Biol 141: 
1539-50, (1998), SEMPl expression in normal human mammary epithelial cells, in 
contrast to low or undetectable levels of expression in a number of breast tumors and 
breast cancer cell lines, may indicate this protein as a possible tumor-suppressor gene, 
(Kramer et al.. Hum Genet 107: 249-56, (2000)). Expression of this protein and tight 
junction morphology were altered in blood vessels of human glioblastoma 
multiforme, (Liebner et al.. Acta Neuropathol 100 (3): 323-31, (2000)). 

Neuroliein 1 (NLGNl) 

[00101] NLGNl was first identified as a neuronal cell surface protein and 

characterized as a splice site-specific ligand for beta-neurexins by Ichtchenko et aL, 
Cell SI: 435-43, (1995). Neuroligin 1 binds to beta-neurexins only if they lack an 
insert in the alternatively spliced sequence of the G domain, not if they contain an 
insert. Findings support a model whereby alternative splicing of neurexins creates a 
^ family of cell surface receptors that confer interactive specificity on their resident 
neurons, (ibid). It was also suggested that these proteins are part of the machinery 
employed during the formation and remodeling of CNS synapses, (Scheiffele et al.. 
Cell 101: 657-69, (2000)). Cloned by Kikuno et al., DNA Res 6: 197-205, (1999), 
neuroligen 1 expression in mammary carcinomas may reflect deregulated gene . 
expression. As carcinomas regressed in a study by Ariazi et al., J Biol Chem 271 : 
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29286-94, (1996), neuroligen 1 expression was repressed or turned off, which is 
consistent with regulated gene expression in most normal tissue. 

Extraneuronal monoamine transporter (EMT) 

[00102] EMTs, polyspecific organic cation transporters in the liver, kidney, and 

intestine are critical for elimination of many endogenous amines as well as a wide 
array of drugs and environmental toxins. This gene was cloned from a human kidney 
carcinoma cell line by Grundemann et al.. Nature Neurosci 1: 349-51, (1998). 
Northern blot analysis detected high level expression of the gene transcript in first- 
trimester and term placenta, skeletal muscle, prostate, aorta, liver, fetal lung, salivary 
gland, and adrenal gland. Moderate to low expression was detected in uterus, ovary, 
kidney, lymph node, lung, trachea, and fetal liver, (Verhaagh et al.. Genomics 55: 
209-18, (1999)). Wu et al., J. Biol Chem 273: 32776-86, (1998) suggested that this 
protein plays a significant role in the disposition of cationic neurotoxins and 
neurotransmitters in the brain. The first report of the extraneuronal monoamine 
transporter (uptake 2) was by Streich et al., Naunyn Schmiedebergs Arch Pharmacol 
353(3): 328-33, (1996). Streich et al. reported that human glioma cells express this 
transporter and thus, glial cells in the human CNS endowed with this transporter are 
likely to contribute to the inactivation of neuronally released noradrenaline. 

COLNOVl (Homo Sapiens Hvpothetical Protein MBC3205) 
[00103] COLNOVl was identified in The Cancer Genome Anatomy Project 

Database by the National Cancer Institute. It was cloned and submitted by Robert L. 
S,trausberg to GenBank in September 2001 with no journal publication cited. Its 
function is not yet elucidated. 

STEAPl 

[00104] STEAPl is a six transmembrane epithelial antigen of the prostate. The 

gene is expressed predominately in human prostate tissue and is up regulated in 
multiple cancer cell lines, including prostate, bladder, colon, ovarian, and Ewing 
sarcoma Hubert et al., Proc Natl Acad Set 96: 14523-28, (1999). Study results 
support STEAP as a cell-surface tumor-antigen target for prostate cancer therapy and 
diagnostic imaging (ibid). The structure of the protein suggests a potential function as 
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a channel or transporter protein. STEAPl was first reported and cloned by Hubert et 
al(1999). 

Claudin 2 

[00105] Claudin 2 was first identified from a chicken liver by Furuse et al., J 

Cell Biol 141: 1539-50, (1998). It is an integral membrane protein localized at tight 
junctions. Research has concluded that claudins regulate the intestinal barrier in 
response to immune mediators, (Kinugasa et al.. Gastroenterology 1 18(6): 1001-1 1, 
(2000)). It is not clear, but has been speculated that this protein as well as other tight 
junction proteins may play a role in the molecular mechanism of brain tumor edema, 
(Papadopoulos et al., Br J Neurosurg 15(2): 101-8, (2001)). Furose et al., J Cell Biol 
153(2): 263-272, (2001) cloned the canine equivalent. 

KIAA0779 ^ 

[00106] KIAA0779 was first identified and cloned from human brain by 

Nagase et al., DNA Res 5(5): 277-86, (1998). It may be functionally related to cell 
signaling/communication, cell structure/motility and nucleic acid management, (ibid). 

Decav accelerating factor for complement (CD55, DAF) 

[00107] CD55 is a 70-kD glycoprotein that aids host tissues to avoid attack by 

autologous complement proteins. DAF interrupts the complement sequence at an 
early step in activation, effectively halts progression of the cascade and prevents 
consequent cell injury. DAF is expressed on the plasma membrane of all cell types 
that are in intimate contact with plasma complement proteins, (Koretz et al., Br J 
Cancer 66: 810-814, (1992). Cloned and characterized by Medof et al., Proc Nat 
Acad Sci 84: 2007-1 1, (1987), DAF can also act as a signal-transducing molecule. 
With monoclonal antibodies directed against DAF, human monocytes can be 
activated in vitro, (Shibuya et al., J Immunol 149: 1758-62, (1992)). Hensel et al.. 
Laboratory Investigation 81 : 1553-63, (2001) have isolated a human antibody SC-1, 
from a patient with a signet ring cell carcinoma of the stomach which induced 
apoptosis of gaistric cancer cells in vitro and is being used successfully in clinical 
trials, (Vollmers, et al., Oncol Rep 5: 549-52, (1988)). The target for the antibody 
was discovered to be a modified DAF protein, (Hensel, et al, (2001)). DAF is over 
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expressed on various tumors such as breast, colon, and stomach carcinoma, (Koretz et 
al., (1992)). Nowicki et al.. Am J Reprod Immunol 46(2): 144-8, (2001), have 
reported that expression of DAF in endometrial adenocarcinoma is inversely related 
to the stage of tumor. This is consistent with the hypothesis that early stage 
endometrial adenocarcinoma that is exposed to complement attack may up-regulate 
DAF to protect malignant cells from complement lysis. 

OATPRPl (solute carrier familv 21 (organic anion transporter), member 12 
[00108] OATPRPl was submitted to GenBank November 16, 1999 by Wu et 
al., (unpublished) as part of identification and characterization of novel human OATP 
family members. 

NKCCl (solute carrier familv 12, member 2: SLC12A2) 

[00109] The Na-K-Cl cotransporter, NKCCl aids transcellular movement of 

chloride across both secretory and absorptive epithelia. It is reportedly expressed in 
many tissues, including the basolateral membrane of secretory epithelia where it 
mediates active chloride secretion, (Quaggin et al., Mamm Genome 6 (8): 557r8, 
(1995)). Specifically, this protein is a bumetanide-sensitive Na-K-Cl cotransporter 
first discovered by Xu et al., Proc Natl Acad Sci 9 1 : 2201-05, ( 1 994). Cloned and 
characterized by Payne et al., J Biol Chem 270: 17977-85, (1995), the authors 
proposed that the NKCCl gene is involved in the control of normal cell proliferation, 
while its overexpression results in apparent cell transformation, in a manner similar to 
some protooncogenes, (Panet et al., J Cell Phys 182: 109-18, (2000)). 

PEM (polymorphic epithelial mucin: mucin L MUCl: peanut-reactive urinarv mucin: 
PUM Mucin: tumor-associated epithelial PEM) 

[00110] PEM is a large cell surface mucin glycoprotein expressed by most 
glandular and ductal epithelial cells and some hematopoietic cell lineages. It has a 
tandem repeat domain that is highly O-glycosylated and alterations in glycosylation 
have been shown in epithelial cancer cells. PEM was first identified by Karlsson et 
al., {Ann Hum Genet Al: 263-9, (1983)) as a polymorphism identified from human 
urinary mucin by SDS polyacrylamide gel electrophoresis followed by detection with 
radiolabelled lectins. Peanut agglutinin was the most effective lectin; hence one of 



31 



01040/1 

the proposed names. It is expressed in other normal and malignant tissues of 
epithelial origin including the mammary gland. This protein is developmentally 
regulated and aberrantly expressed in breast cancer, (Gendler et al, J Biol Chem 265: 
15286-93, (1990)). Individuals with small PEM alleles/genotypes have an increased 
risk for development of gastric carcinoma, (Silva, et al., Eur J Hum Genet 9(7): 548- 
52, (2001)). PEM is activated in B-cell lymphoma by translocation and is rearranged 
and amplified in B-cell lymphoma subsets, (Dyomin, et al., Blood 95: 2666-71, 
(2000)). PEM was cloned and characterized by Gendler, et al., (1990). 

NRAMP2 (natural resistance-associated macrophage protein 2; divalent cation 
transporter 1; DCTl; divalent metal transporter 1; DMTl; solute carrier family 1 1 
(proton-coupled divalent metal ion transporter), member 2; SLCl 1 A2 
[00111] NRAMP2 was first identified in the mouse genome by Gruenheid et 

al.. Genomics 25: 514-25, (1995). Human NRAMP2 was isolated and characterized 
by Vidal et al.. Mammalian Genome 6: 224-30, (1995). Its function is mediation of 
active transport that is proton-coupled and dependent on the cell membrane potential. 
It is up regulated by dietary iron deficiency and may represent a key mediator of 
intestinal iron absorption, (Gunshin et al.. Nature 388: 482-88, (1997); Tandy, et al., 
J Biol Chem 275(2): 1023-29,(2000)). 

Integrin, Beta-4 (ITGB4) 

[00112] Integrin beta 4 was first identified, characterized, and cloned by 

Suzuki, et al., EMBO J 9(3):757-63, (1990). Integrins are transmembrane 
glycoprotein receptors that mediate cell-matrix or cell-cell adhesion, and transduce 
signals that regulate gene expression and cell growth, (Vidal et al.. Nature Genet 
10:229-34, (1995)). Integrins are heterodimeric molecules that consist of 
noncovalently linked alpha and beta subunits. Alpha-6/beta-4 is restricted to the 
ventral surface opposed to the basal membrane zone, suggestive of its role in cell- 
matrix adhesion. Consistent with this possibility, alpha-6/beta-4 is found to be 
associated with the hemidesmosomes in stratified and transitional epithelia, (ibid). 
The cytoplasmic domain of the integrin beta-4 subunit mediates both association with 
the hemidesmosomal cytoskeleton and recruitment of the signaling adaptor protein 
SHC. This integrin is a receptor for the laminins, and likely plays a role in invasive 
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carcinomas. Shaw et al.. Cell 91: 949-60, (1997) demonstrated in the MDA-MB-435 
breast carcinoma cell line, that the alpha-6/beta-4 integrin promotes carcinoma 
invasion through a preferential and localized targeting of phosphoinositide-3 OH 
kinase (PI3K) activity. Expression of alpha-6/beta-4 integrin has been shown to 
increase during malignant conversion of mouse epidermal keratinocytes, (Gomez et 
sA., Exp Cell Res 201: 250-61,(1992)). 

HTK ligand (hepatoma transmembrane kinase lijgand: EPH-related receptor tvrosine 

kinase ligand 5; EPLG5; Ligand of EPH-related kinase 5: LERK5: HTKL; EPHRIN 
B2; EFNB2) 

[00113] EPHRIN B2 was first reported by Carpenter, et al., J Neurosci Res 

42(2): 199-206, (1995) and cloned from hematopoietic monocytic lineage by Bennett 
et al., Proc Natl Acad Sci 92(6): 1866-70, (1995). Studies by Sakano, et al.. 
Oncogene 13(4): 813-22, (1996) suggest the involvement of the HTK-HTKL system 
in the proliferation of HTK+ hematopoietic progenitor cells in the hematopoietic 
environment. According to Bennett et al., (1995), this ligand and its receptor are 
widely expressed and may function in a variety of tissues. HTK ligand is reportedly 
expressed in adult lung and kidney and in fetal heart, lung, kidney, and brain, (Cerretti 
et al., Immun 32: 1 197-1205, (1995)). Unlike most of its family members, the 
ligand's receptor doesn't appear to be expressed in the CNS. However, similar to 
other members, it is expressed in primary epithelia and epithelial cell-derived lines, 
(Bennett et al., (1995)). The HTK-HTKL interaction may represent an early step in 
the initiation of synapse formation or maturation and may potentiate the ability of the 
receptor to respond to activity-dependent signals from the extracellular milieu, 
(Takasu, et al.. Science 295: 491-95, (2002)). It is reported that HTK ligand 
^ transcripts are highly expressed in several small lung cell carcinoma cell lines and 
may modulate the biological behavior of these cells through autocrine and/or 
juxtacrine activation, (Tang et al., Hum Mol Genet 4: 2033-45, (1995)). Increased 
HTKL expression possibly reflects or induces an increased potential for growth, 
tumorigenicity, and metastatic abilities in human melanomas. The HTK-HTKL 
system could be a potential new source for molecular markers as well as a target for 
new therapies, (Vogt, et al., Clin Cancer Res 4(3): 791-7, (1998)). Studies by Liu et 
al.. Cancer 94(4): 934-9, (2002) demonstrate that the ligand is expressed 
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differentially in colon carcinoma and normal mucosa specimens and thus may play a 
role in the progression of colon carcinoma. 

T245 protein fT245: TM4SF6: transmembrane 4 superfamilv member 6) 
[00114] Several TM4SF proteins have been shown to stimulate or modulate 

cell growth, (Bell et al., / Exp Med 175: 527-536, (1992); Higashiyama et al., / Cell 
Biol 128: 929-938, (1995); Lebel-Binay et dA., J Immunol 155: 101-1 10, (1995); 
Maecker et al., FASEB 7 11: 428-442, (1997)). Some may associate with integrins 
and control cell adhesion and movement, (Hadjiargyrou et a\,, J Neurochem 67: 
2505-2513, (1996); Hemler et al., Biochem Biophys Acta 1287: 67-71, (1996); 
Maecker et aL, (1997)). TM4SF6 was identified, cloned, and characterized from a 
human glioma by Maeda et al., Genomics 52: 240-242, (1998). Gene expression is 
reportedly widely found in human adult tissues. Among the tissues with high-level 
expression are the liver, pancreas, kidney, and ovary, while low-level expression was 
observed in the skeletal muscle, lung, and brain, (Maeda et al., (1998 

Carcinoembrvonic antigen-related cell adhesion molecule 5 (CEACAM 5) 
[00115] CEACAM5 was first described by Gold and Freedman, / Exp Med 

121 : 439-62, (1965) as a complex immunoreactive glycoprotein comprised of 60% 
carbohydrate. It is found in adenocarcinomas of endodermally derived digestive 
system epithelia and in fetal colon. The CEA immunoassay is useful in the diagnosis 
and serial monitoring of cancer patients for recurrent disease or response to therapy, 
particularly in colon cancer. CEACAM5 was isolated and characterized by 
Zimmermann et al., Proc Nat Acad Sci 84: 2960-64, (1987). The CEA gene was 
renamed CEACAM5, (Beauchemin et al., Exp Cell Res 252: 243-49, (1999)). 

Protocadherin-9 fPCDH9) 

[00116] Protocadherin-9 is a member of the cadherin superfamily. 

Protocadherins are a subfamily of calcium-dependent cell adhesion and recognition 
proteins. PCDH9 was first isolated and identified by Strehl et al., Genomics 53: 81- 
89, (1998) from a human fetal brain library. It is closely related to PCDHl and 
PCDH7. Like other protocadherins, PCDH9 is predominately expressed in fetal and 
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adult brain. They have a developmentally regulated expression pattern in brain that 
suggests this protein directs various aspects of morphogenesis, (Strehl et al., (1998 

Monocarboxvlate transporter (MCT3) 

[00117] The function of the proton-linked monocarboxylate transporter family 

is transport of substances across the plasma membrane. For example, lactic acid and 
pyruvate are transported via members of this family. Each member of the family 
appears to have slightly different substrate and inhibitor specificities and transport 
kinetics, which are related to the metabolic requirements of the tissues in which they 
are found. MCT3 was identified, cloned, and sequenced by Price et al, Biochem J 
329: 321-28,(1998). MCT3 appears to be the MCT isoform expressed most in 
muscle fibers where energy metabolism is mainly glycolytic. It is a major route for 
lactic acid efflux from skeletal muscle and other cells, (Wilson, et al., J Biol Chem 
273: 15920-26, (1998)). MCT3 is also found in retinal pigment epithelia, where it 
transports lactate between two tissue compartments, the interphotoreceptor matrix and 
the choriocapillaris, (Yoon et al., Biochem Biophys Res Comm 234: 90-94, (1997 

Osteopontin (secreted phosphoprotein 1: SPPl: OPN: bone sialoprotein: urinary stone 
protein) 

[00118] Osteopontin is the principal phosphorylated glycoprotein of bone and 

is expressed in a limited number of other tissues including dentine, (Crosby et al.. 
Genomics 27: 155-160,-(1995)). Osteopontin is produced by osteoblasts under 
stimulation by calcitrol and binds tightly to hydroxyapatite. It has been shown to be 
involved in the anchoring of osteoclasts to the mineral of bone matrix, (Reinholt et al., 
Proc Nat Acad Sci 87: 4473-75, (1990)). Urinary calcium oxalate stones also consist 
of this protein, (Kohri et al.,JBiol Chem 268: 15180-84, (1993)). Osteopontin is a 
constitutive component of normal elastic fibers in human skin and aorta. It has been 
suggested that it plays a role in modulating crystal nucleation and growth in 
mineralizing tissues and, more generally, in conditions in which mineral precipitation 
should be controlled, (Baccarini-Contri et al.. Matrix Biol 14: 553-560, (1994)). This 
protein has been implicated in a number of pathologies, including but not limited to 
breast, colon, prostate, and lung cancers. There is evidence that osteopontin enhances 
malignancy, and that signaling pathways directly induced by this protein, as well as 
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interactions with growth factor receptor pathways, can combine to activate expression 
of genes and functions that contribute to metastasis, (Furger et al., CurrMol Med 
1(5): 621-32, (2001)). Recent clinical evidence also suggests that osteopontin levels 
in cancer patients* blood or tumors may help provide prognostic information. The 
earliest report of this protein was by Oldberg, et al., Proc Natl Acad Sci 83 (23): 
8819-23 (1986) where they cloned and sequenced osteopontin from a rat. The human 
gene was cloned and sequenced by Kiefer et al.. Nucleic Acids Res 17: 3306, (1989). 

Voltage-gated potassium channel, KOT-like subfamily, member 1 (KCNOl) 
[00119] KCNQl is a member of the largest and most diversified class of ion 

channels. Their main functions are associated with the regulation of the resting 
membrane potential and the control of the shape and frequency of action potentials. 
These channels are made up of multimeric proteins containing alpha subunits, which 
are directly responsible for channel activity, and gamma subunits, which modify the 
basic channel activity. KCNQl was first identified, cloned, and characterized by Want 
et al., Nat Genet 12 (1): 17-23, (1996). This specific protein is essential for the 
repolarization phase of the cardiac action potential and for K+ homeostasis in the 
inner ear, (Neyroud et al., Circ Res 84: 290-97, (1999)). Northern blot analyses and * 
in situ hybridization indicate that KCNQl is expressed in kidney, pancreas, lung, 
placenta, inner ear, and in heart with the highest level of expression, (Sanguinetti et 
al.. Nature 384: 80-83, (1996); Wang et al., Nat Genet 12: 17-23, (1996); Neyroud et 
al., Nat Genet 15: 186-89, (1997)). This gene is located on 1 lpl5.5, in a large 
domain of contiguous genes abnormally imprinted in cancer and with Beckwith- 
Wiedemann syndrorne, (Lee et ah, Nat Genet 15: 181-85, (1997)). Mutations in 
KCNQl are the most frequent cause of the long-QT syndrome (LQTS), which is an 
inherited cardiac disorder that predisposes individuals to syncope, seizures, and 
sudden cardiac death from ventricular tachyarrhythmias, (Schwartz, et al.. Am Heart J 
89: 378-90,(1975)). 

Epican 

[00120] Epican is a heparin sulfate proteoglycan. This protein has a novel 339 
amino acid domain inserted into the proximal extracellular domain of the standard, 
leukocyte form of CD44. It was first identified and characterized as a proteoglycan 
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on human keratinocytes by Haggerty et a\., J Invest Dermatol 99 (4): 374-80, (1992) 
and given the name epican, which stands for epidermal intercellular proteoglycan. 
Epican is expressed on the outer cell surface of squamous-cell carcinomas and can be 
a target for other tumors as well, (Van Hal, et al., Int J Cancer 68 (4): 520-27, 
\1996)). 



Membrane cofactor protein fCD46: MCP: measles virus receptor) 
[00121] The level of complement activation on cell membranes is regulated by 

the expression of membrane-bound complement regulatory proteins, which protect 
normal and tumor cells from uncontrolled complement-mediated injury, (Gorter and 
Meri, Immunol Today 20: 576-582, (1999)). Membrane cofactor protein is one of 
these regulatory proteins. Several studies have shown that in situ tumor cells 
overexpress CD46, (Koretz et al., Br J Cancer 66: 810-814, (1992); Li et al., Br J 
CancerM: 80-86, (2001); Maenpaaet al.. Aw 7 Par/io/ 148: 1139-52,(1996); 
Niehans et al.. Am J Pathol 149: 129-142, (1996); Yamakawa et al.. Cancer 73: 
2808-17, (1994)). The overexpression of this and other regulatory proteins of 
complement on tumor cells may prevent an efficient local immune response. 
According to Durrant and Spendlove in Curr Opin Investig Drugs 7:959-66, (2001), 
expression of the complement-regulatory proteins CD55, CD46 and CD59 are 
deregulated in cancer with tumors showing loss of one or more inhibitors and strong 
overexpression of others. This results in tumors that are resistant to attack by 
complement. Studies performed in vitro and in vivo have shown that tumor sensitivity 
to complement can be restored by co-administration of antibodies that bind to the 
functional domains of complement-regulatory proteins, (Durrant and Spendlove, 
(2001)). Sparrow et al., Hum Immunol 13: 83-93, (1985) first reported CD46 as 
HuLy-m5. Purcell et al., Immunogenetics 33: 335-44, (1991) isolated and cloned this 
protein. 



Solute carrier familv 6 member 6 (neurotransmitter transporter, taurine: SLC6A6) 
[00122] Taurine is a major intracellular amino acid in mammals. It is involved 

in bile acid conjugation in hepatbcytes, modulation of calcium flux and neural 
excitability, osmoregulation, detoxification, and membrane stabilization. The taurine 
transporter (SLC6A6) has considerable amino acid sequence similarity to sodium- and 
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chloride-dependent transporters, (Uchida et al., Proc Nat Acad Sci 89: 8230-34, 

(1992) ). Ramamoorthyet aL,^/oc/iemy300: 893-900, (1994) cloned and 
characterized this gene from human placenta. Northern blot analysis by 
Ramamoorthy et al., (1994) revealed that the principal transcript is expressed 
abundantly in placenta and skeletal muscle, at intermediate levels in heart, brain, lung, 
kidney, and pancreas; while at low levels in liver. Cultured human cell lines from 
placenta, intestine, cervix, and retinal pigment epithelia also contain the transcript. 

Osteoblast specific factor 2, OSF-2os: periostin 

[00123] This protein shares structural and sequence homology with fasciclin I, 

which is an insect adhesion molecule, (Takeshita et al., Biochem J 294: 271-27.8, 

(1993) ). There is substantial evidence to suggest that cell-cell and cell-matrix 
adhesive interactions play a crucial role in tumorigenesis, tumor progression, and in 
particular metastasis, (Albeda, Lab Invest 68: 4-17, (1993); Tuszynski, et al.. Acta 
Haematol 97: 29-39, (1997)). It has been observed that periostin was over expressed 
ten-fold in glioblastoma compared to normal brain tissue and the gene is 
overexpressed in several human tumors including carcinomas of ovary, breast and 
brain, (Lai et al.. Cancer Res 59: 5403-07, (1999)). This protein is also over 
expressed in ovarian tumors as well, (Ismail et al.. Cancer Res 60: 6744-49, (2000)). 
Results by Sasaki et al.. Cancer Letters 172: 37-42, (2001) show that the periostin 
gene is highly expressed in the surrounding stromal cells of breast and lung cancer 
tissue but not within the tumor by in situ RNA hybridization, suggesting that 
expression of periostin may be involved in tumor invasion. This gene was first cloned 
and characterized by Takeshita et al., (1993). Periostin is expressed in bone and to a 
lesser extent in the lung, but not other tissues. 

CEACAM8 (CGM6, CD66b, NCA-95, NCA-W272) 

[00124] CEACAM8 is a member of the CEA gene family of cell adhesion 

molecules, which is primarily expressed in neutrophils and eosinophils (Eades-Pemer 
et. al., (1998) Blood 91 : 663-672). Two groups reported the cloning of the gene for 
CEACAM8 in 1990. Beriing et. al. ((1990) Cancer Res 50:6534-6539) cloned the 
gene from a library constructed from a CML patient and Arakawa et. al. ((1990) 
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BBRC 166: 1063-1071) identified the same gene from a library prepared from normal 
human peripheral white blood cells. 

FGFR3 

[00125] Fibroblast Growth Factor Receptor 3 ("FGFR3") is a member of the 
fibroblast growth factor receptor ("FGFR") family of the receptor tyrosine kinase 
proteins (RTK's). Fibroblast growth factor receptors mediate growth, differentiation 
and cellular migration in a diverse range of cell types. Hart, K. C, et al., MoL Biol 
Cell, 12:931-941, 2001. Fibroblast growth factor ("FGF') signaling plays a role in 
mitogenesis, mesoderm induction, neuronal survival and neuritic extension, tumor 
angiogenesis, and atherosclerosis, Pandit, S. G., et al., Biochem. 7., 361:231-241, 
2002. Ligand binding to the extracellular domain of the FGFR receptors induces 
receptor dimerization and transphosphorylation of tyrosine residues in the 
intracellular domains of the receptors. Links between FGFR3 and cancer has recently 
been uncovered (Hart, K. C, et al., Mol Biol Cell, 12:931-941, 2001; Pandit, S. G., 
et al., Biochem. 361 :23 1-241 , 2002). In 1990, FGFR3 was reported by Elena B. 
Pasquale as cek2, one of two novel kinases present in chicken embryos that were 
homologous to cekl, a chicken fibroblast growth factor receptor, Pasquale, E. B., 
Proc. Natl Acad. Sci. USA, 87:5812-5816, 1990. In 1991, Keegan et al. (Natl Acad. 
ScL USA, 88:1095-1099, 1991) reported the isolation of the gene. The cDNA was 
sequenced in its entirety and designated as Fibroblast Growth Factor Receptor 3 
(FGFR3), Anti-FGFR antibodies were first reported at least as earlier as 1991 (Bellot. 
etal EMBOJ., 10:2849-2854, 1991). In 1991, Keegan et al Oncogene, 6:2229- 
2236, 1991) attempted to create an antibody specific for FGFR3 but encountered 
significant difficulties in obtaining an antibody that did not cross react with other 
FGFR receptors. 

GPR56 (TM7XN1, TM7LN4, EGF TM7-like cDNA) 

[00126] GPR56, also known as TM7XN 1 , TM7LN4 or as EGF TM7-like 

cDNA, is a novel g-protein coupled receptor (GPR) independently identified by two 
separate groups of research scientists in 1999 (Liu, M., et al. Genomics, 55:296-305, 
1999: Zendman, A. J. W., et al., FEBS Utters, 446:292-298, 1999). GPR56, like 
other g-protein coupled receptors, is dominated by a seven transmembrane (TM7) 
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domain consisting of seven hydrophobic stretches of amino acids, each forming a 
transmembrane domain that traverses the plasma membrane of the cell. These 
domains are highly conserved among the variou^ g-protein coupled receptors. The 
function of a g-protein coupled orphan receptors remains unidentified. However, the 
presence of mucin-like and/or EGF-like domains suggests that GPR56 is involved in 
cell-cell binding. While GPR56 lacks the EOF domains, reports suggest that it may 
be involved in cell-cell adhesion through glycosylation of its N-terminal domain. 

P Cadherin (placental cadherin, pcad. cadherin-3, cdh3 or cdhp) 

[00127] . Cadherins are a superfamily of transmembrane proteins regulating cell- 
cell adhesion during development and tissue homeostasis (Gumbiner, B.M., 7. Cell, 
Biol, 148:399-404, 2000; Yagi, T. and Takeichi, M., Genes Dev„ 14:1169-1 180, 
2000). Cadherins have five extracellular Ca2+ binding domains and a small 
cytoplasmic domain that is highly conserved among the classical cadherins. P- 
cadherin expression is often altered in only a subset of a given type of tumor. It is 
known that P-cadherin is upregulated in inflammatory bowel diseases such as Crohn's 
disease and colitis. Aberrant P-cadherin expression has been associated with cell 
proliferation and dedifferentiation in breast cancer (Gamallo, C, Modem Pathology, 
14:650-654, 2001). Human P-cadherin w£is reported to be the antigen recognized by 
the NCC-CAD-299 monoclonal antibody raised against a vulvar epidermoid 
carcinoma (Shimoyama, Y., et al.. Cancer Res,, 49:2128-2133, 1989). The human 
gene was isolated by Shimoyama, Y., et al., 7. Cell Biol, 109:1787-1794, 1989). 

RON kinase 

[00128] RON (Recepteur d'Origine Nantais) is a receptor protein tyrosine 

kinase belonging to the hepatocyte growth factor ("HGF") receptor family. Protein 
tyrosine kinases are enzymes that transfer the terminal phosphate of adenosine 
triphosphate (ATP) to a specific tyrosine residue on a target protein. These enzymes 
are found in all multicellular organisms and play a central role in the regulation of 
cellular growth and in the differentiation of complex eukaryotes. There are two major 
classes of tyrosine kinases: transmembrane receptor tyrosine kinases and non-receptor 
tyrosine kinases. Transmembrane receptor tyrosine kinases are activated directly by 
binding of peptide growth factors and cytokines to their extracellular domains. 



40 



01040/1 

Tyrosine kinases that fall within this class include the receptor for hepatocyte growth 
factor. The normal function of the receptor is to act as transducer of extracellular 
signals. The Ron gene encodes a 190 kDa protein, the mature form being a disulfide- 
linked heterodimer. RON comprises a40-kD extracellular a chain and a 150-kD p 
chain having an intracellular protein tyrosine kinase domain, the activity of which is 
increased by ligand-receptor binding (Leonard, E.J. and Danilkovitch, A., Advances in 
Cancer Research, 2000, 139-165). 

KIAA0792 

[00129] KIAA0792 is a gene of unknown function, primarily expressed in the 

kidney, brain, ovary, lung and pancreas. The KIAA0792 transcript has been mapped 
to human chromosome 1 at position lq42.13. The KIAA0792 locus contains 25 
exons spread over 36,774 base pairs of genomic DNA.. KIAA0792 consists of a 4074 
nucleotide sequence encoding a protein of 807 amino acids. The sequences are 
available in Genbank, Accession Number ABO 18335. The protein encoded by 
KIAA0792 may contain as many as ten transmembrane domains, positioned at amino 
acids 67-89, 166-188, 212-234, 446-468, 487-509, 528-550, 583-605, 635-657, 689- 
711, and 717-739. In addition to these transmembrane domains, at least one other 
recognized domain, DUF221, is located at position numbers 356-806. This domain is 
present in a number of other putative transmembrane proteins of unknown function. 

[00130] The present invention discloses genes that have not previously been 

reported to be over-expressed in human tumor tissue compared to corresponding 
normal tissue: SEQ ID NO:5; SEQ ID NO:8; SEQ ID NO:9; SEQ ID NO: 16; SEQ ID 
NO: 17; SEQ ID NO: 19; SEQ ID NO:20; SEQ ID NO:22; SEQ ID NO:25; SEQ ID 
NO:26; SEQ ID NO:28; SEQ ID NO:29; SEQ ID NO:32; SEQ ID NO:35; and SEQ 
IDNO:3113. 

[00131] The present invention also discloses genes that have previously been 
reported to be over-expressed in human tumor tissue but not specifically in colon 
tumors: SEQ ID NO:2; SEQ ID NO:3; SEQ ID NO:4; SEQ ID NO: 1 0; SEQ ID 
NO: 1 1; SEQ ID NO: 15; SEQ ID NO:33; SEQ ID NO:34; SEQ ID NO:37. 
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Polypeptides 

[00132] Another embodiment of the invention is an isolated polypeptide 

comprising an amino acid sequence selected from the group consisting of: 

(a) a mature form of an amino acid sequence selected from the group consisting of 
SEQ ID NO:39; SEQ ID NO:40; SEQ ID NO:41; SEQ ID NO:42; SEQ ID NO:43; 
SEQ ID NO:46; SEQ ID NO:47; SEQ ID NO:48; SEQ ID NO:49; SEQ ID NO:53; 
SEQ ID NO:54; SEQ ID NO:55; SEQ ID NO:56; SEQ ID NO:57; SEQ ID NO:58; 
SEQ ID NO:59; SEQ ID NO:60; SEQ ID NO:61 ; SEQ ID NO:62; SEQ ID NO:63; 
SEQ ID NO:64; SEQ ID NO:65; SEQ ID NO:66; SEQ ID NO:67; SEQ ID NO:68; 
SEQ ID NO:70; SEQ ID NO:71; SEQ ID NO:72; SEQ ID NO:73; SEQ ID NO:74; 
SEQ ID NO:75; and SEQ ID NO: 3114; 

(b) a variant of a mature form of an amino acid sequence selected from 
the group consisting of SEQ ID NO:39; SEQ ID NO:40; SEQ ID NO:41; SEQ 
ID NO:42; SEQ ID NO:43; SEQ ID NO:46; SEQ ID NO:47; SEQ ID NO:48; 
SEQ ID NO:49; SEQ ID NO:53; SEQ ID NO:54; SEQ ID NO:55; SEQ ID 
NO:56; SEQ ID NO:57; SEQ ID NO:58; SEQ ID NO:59; SEQ ID NO:60; 
SEQ ID NO:61; SEQ ID NO:62; SEQ ID NO:63; SEQ ID NO:64; SEQ ID 
NO:65; SEQ ID NO:66; SEQ ID NO:67; SEQ ID NO:68; SEQ ID NO:70; 
SEQ ID NO:71; SEQ ID NO:72; SEQ ID NO:73; SEQ ID NO:74; SEQ ID 
NO:75; and SEQ ID NO:31 14, wherein one or more amino acid residues in 
said variant differs from the amino acid sequence of said mature form, 
provided that said variant differs in no more than 20%, of the amino acid 
residues from the amino acid sequence of said mature form; 

(c) an amino acid sequence comprising a sequence selected from the 
group consisting of SEQ ID NO:39; SEQ ID NO:40; SEQ ID NO:41 ; SEQ ID 
NO:42; SEQ ID NO:43; SEQ ID NO:46; SEQ ID NO:47; SEQ ID NO:48; 
SEQ ID NO:49; SEQ ID NO:53; SEQ ID NO:54; SEQ ID NO:55; SEQ ID 
NO:56; SEQ ID NO:57; SEQ ID NO:58; SEQ ID NO:59; SEQ ID NO:60; 
SEQ ID NO:61 ; SEQ ID NO:62; SEQ ID NO:63; SEQ ID NO:64; SEQ ID 
NO:65; SEQ ID NO:66; SEQ ID NO:67; SEQ ID NO:68; SEQ ED NO:70; 
SEQ ID NO:7 1 ; SEQ ID NO:72; SEQ ID NO:73; SEQ ID NO:74; SEQ ID 
NO:75; and SEQ ID NO:31 14; and 



42 



01040/1 



(d) a variant of an amino acid sequence comprising a sequence selected 
from the group consisting of SEQ ID NO:39; SEQ ED NO:40; SEQ ID NO:41 ; 
SEQ ID NO:42; SEQ ID NO:43; SEQ ID NO:46; SEQ ID NO:47; SEQ ID 
NO:48; SEQ ID NO:49; SEQ ID NO:53; SEQ ID NO:54; SEQ ID NO:55; 
SEQ ID NO:56; SEQ ID NO:57; SEQ ID NO:58; SEQ ID NO:59; SEQ ID 
NO:60; SEQ ID NO:61 ; SEQ ID NO:62; SEQ ID NO:63; SEQ ID NO:64; 
SEQ ID NO:65; SEQ ID NO:66; SEQ ID NO:67; SEQ ID NO:68; SEQ ID 
NO:70; SEQ ID NO:71 ; SEQ ID NO:72; SEQ ID NO:73; SEQ ID NO:74; 
SEQ ID NO:75; and SEQ ID NO:31 14, wherein one or more amino acid 
residues in said variant differs from the amino acid sequence, provided that 
said variant differs in no more than 20% of amino acid residues from said 
amino acid sequence. 

In one aspect of the invention the SEQ ID NO:39; SEQ ID NO:40; 
SEQ ID NO:41 ; SEQ ID NO:42; SEQ ID NO:43; SEQ ID NO:46; SEQ ID 
NO:47; SEQ ID NO:48; SEQ ID NO:49; SEQ ID NO:53; SEQ ID NO:54; 
SEQ ID NO:55; SEQ ID NO:56; SEQ ID NO:57; SEQ ID NO:58; SEQ ID 
NO:59; SEQ ID NO:60; SEQ ID NO:61 ; SEQ ID NO:62; SEQ ID NO:63; 
SEQ ID NO:64; SEQ ID NO:65; SEQ ID NO:66; SEQ ID NO:67; SEQ ID 
NO:68; SEQ ID NO:70; SEQ ID NO:71 ; SEQ ID NO:72; SEQ ID NO:73; 
SEQ ID NO:74; SEQ ID NO:75; and SEQ ID NO:3114. SEQ ID NO:39; SEQ 
ID NO:56; SEQ ID NO:59; SEQ ID NO:61; SEQ ID NO:62; SEQ ID NO:65; 
and SEQ ID NO:68 

[00133] Forms of a protein of the present invention can be recovered from 

culture medium or from host cell lysates. If membrane-bound, it can be released from 
the membrane using a suitable detergent solution (e.g. Triton-X 100) or by enzymatic 
cleavage. Cells employed in expression of polypeptide can be disrupted by various 
physical or chemical means, such as freeze-thaw cycling, sonication, mechanical 
disruption, or cell lysing agents. 

[00134] It can be desired to purify a protein of the present invention from 

recombinant cell proteins or polypeptides. The following procedures are exemplary of 
suitable purification procedures: by fractionation on an ion-exchange column; ethanol 
precipitation; reverse phase HPLC; chromatography on silica or on a cation-exchange 
resin such as DEAE; chromatofocusing; SDS-PAGE; ammonium sulfate 
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precipitation; gel filtration using, for example, Sephadex G-75; protein A Sepharose 
columns to remove contaminants such as IgG; and metal chelating columns to bind 
epitope-tagged forms of the a protein of the present invention. Additionally, other 
purification methods known to those skilled in the art can be used. Various methods 
of protein purification can be employed and such methods are known in the art and 
described for example in Deutscher, Methods in Enzymology, 182 (1990); Scopes, 
Protein Purification: Principles and Practice, Springer- Verlag, New York (1982). 
The purification step(s) selected will depend, for example, on the nature of the 
production process used and the particular protein produced. 

Polypeptide Variants 

[00135] In addition to the full-length native sequence polypeptides described 

herein, variants with functions similar to those of the polypeptides disclosed cein be 
prepared. Variants can be prepared by introducing appropriate nucleotide changes into 
the DNA, and/or by synthesis of the desired polypeptide. Those skilled in the art will 
appreciate that amino acid changes can alter post-translational processing of a protein 
of the present invention, such as changing the number or position of glycosylation 
sites or altering the membrane anchoring characteristics. Variations in the native full- 
length sequence or in various domains of the a protein of the present invention 
described herein, can be made, for example, using any of the techniques and 
guidelines for conservative and non-conservative mutations set forth, for instance, in 
U.S. Patent No. 5,364,934. Variations can include the substitution, deletion or 
insertion of one or more codons encoding the protein of the present invention that 
results in a change in the amino acid sequence of the a protein of the present invention 
as compared with the native sequence. Optionally, the variation is by substitution of at 
least one amino acid with any other amino acid in one or more of the domains of the 
polypeptides. Guidance in determining which amino acid residue can be inserted, 
substituted or deleted without adversely affecting the desired activity can be found by 
comparing the sequence of the polypeptide with that of homologous known protein, 
derived from other mammals, and minimizing the number of amino acid sequence 
changes made in regions of high homology. Amino acid substitutions can be the result 
of replacing one amino acid with another amino acid having similar structural and/or 
chemical properties, such as the replacement of a leucine with a isoleucine. Such 
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substitutions are known as conservative amino acid replacements. Insertions or 
deletions are, optionally, in the range of about 1 to 5 amino acids. The variation 
allowed can be determined by systematically making insertions, deletions or 
substitutions of amino acids in the sequence and testing the resulting variants for 
activity exhibited by the full length or mature native sequence. 
[00136] In particular embodiments, conservative substitutions of interest are 
shown in Table 2 under the heading of preferred substitutions. If such conservative 
substitutions result in a change in biological activity, then more substantial changes, 
denominated exemplary substitutions in Table 2, or as further described below in 
reference to amino acid classes, are introduced and the products screened for activity. 
[00137] Substantial modifications in function or immunological identity of the 

polypeptide are accomplished by selecting substitutions that differ significantly in 
their effect on maintaining (a) the structure of the polypeptide backbone in the area of 
the substitution, for example, as a sheet or helical conformation, (b) the charge or 
hydrophobicity of the molecule at the target site, or (c) the bulk of the side chain. 
Naturally occurring residues can be divided into groups based on common side-chain 
properties: (1) hydrophobic: norleucine, met, ala, val, leu, ile; (2) neutral hydrophilic: 
cys, ser, thr; (3) acidic: asp, glu; (4) basic: asn, gin, his, lys, arg; (5) residues that 
influence chain orientation: gly, pro; and (6) aromatic: trp, tyr, phe. 
[00138] . Non-conservative substitutions will entail exchanging^a member of one 
of these classes for another class. Such substituted residues also can be introduced 
into the conservative substitution sites or, more preferably, into the remaining (non- 
conserved) sites. 

[00139] The variations can be made using methods known in the art such as 

oligonucleotide mediated (site-directed) mutagenesis, alanine scanning, and PGR 
mutagenesis. Site-directed mutagenesis (See Carter et al., NucL Acids Res,, 13:4331 
(1986); Zoller et aL, NucL Acids Res., 10:6487 (1987)), cassette mutagenesis (See 
Wells et al. Gene, 34:315 (1985)), restriction selection mutagenesis (See Wells et al, 
Philos, Trans. R. Soc. London SerA, 317:415 (1986)) or any other known techniques 
can be performed on the cloned DNA to produce the variant DNA. 
[00140] Scanning amino acid analysis can also be employed to identify one or 

more amino acids along a contiguous sequence. Among the preferred scanning amino 
acids are relatively small, neutral amino acids. Such amino acids include alanine. 
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glycine, serine, and cysteine. Alanine is typically a preferred scanning amino acid 
among this group because it eliminates the side-chain beyond the beta-c£u*bon and, 
thus, is less likely to alter the main-chain conformation of the variant (See 
Cunningham and Wells, Science, 244: 1081-1085 (1989)). Alanine is also typically 
preferred because it is the most common amino acid. Further, it is frequently found in 
both buried and exposed positions (See Creighton, The Proteins, (W.H. Freeman & 
Co., N.Y.); Chothia, 7. MoL Biol, 150:1 (1976)). However, if alanine substitution 
does not yield adequate amounts of variant, an isosteric amino acid can be used. 
[00141] Fragments of the protein of the present invention can be prepared by 

any of a number of conventional techniques. Desired peptide fragments can also be 
chemically synthesized. An alternative approach involves generating fragments by 
enzymatic digestion, e.g., by treating the protein with an enzyme known to cleave 
proteins at sites defined by particular amino acid residues, or by digesting the DNA 
with suitable restriction enzymes and isolating the desired fragment. Another suitable 
technique involves isolating and amplifying a DNA fragment encoding a desired 
polypeptide fragment,. by polymerase chain reaction (PCR). Oligonucleotides that 
define the desired termini of the DNA fragment are employed at the 5 'and 3' primers 
in the PCR. Preferably, polypeptide fragments share at least one biological and/br 
immunological activity with the native polypeptide. 

Modifications of polypeptides 

[00142] Covalent modifications of a protein of the present invention are 

included within the scope of this invention. One type of covalent modification 
includes reacting targeted amino acid residues of a polypeptide with an organic 
derivatizing agent that is capable of reacting with selected side chains or the N- or C- 
terminal residues of the a protein of the present invention. Derivatization with 
bifunctional agents is useful, for instance, for crosslinking protein to a water-insoluble 
support matrix or surface for use in the method for purifying antibodies, and vice- 
versa. Commonly used crosslinking agents include e. g. l,l-bis(diazoacetyl)-2- ^ 
phenylethane glutaraldehyde, N-hydroxysuccinimide esters, for example, esters with 
4-azidosalicylic acid, homobifunctional imidoesters, including disuccinimidyl esters 
such as 3,3 -dithiobis(succinimidylpropionate), bifunctional maleimides such as bis- 
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N-maleimido- 1 , 8 -octane and agents such as methyl-3- 
[(pazidophenyl)dithio]propioiniidate. 

[00143] Other modifications include deamidation of glutaminyl and 

asparaginyl residues to the corresponding glutamyl and aspartyl residues, 
respectively, hydroxylation of proline and lysine, phosphorylation of hydroxyl groups 
of seryl or threonyl residues, methylation of the - amino groups of lysine, arginine, 
and histidine side chains (See T.E. Creighton, Proteins: Structure and Molecular 
Properties^ W.H. Freeman & Co., San Francisco, pp. 79-86 (1983)), acetylation of the 
N-terminal amine, and amidation of any C-terminal carboxyl group. 
[00144] Another type of covalent modification of the polypeptide included 

within the scope of this invention comprises altering the native glycosylation pattern 
of the polypeptide. "Altering the native glycosylation pattern" is intended for purposes 
herein to mean deleting one or more carbohydrate moieties found in the native 
sequence (either by removing the underlying glycosylation site or by deleting the 
glycosylation by chemical and/or enzymatic means), and/or adding one or more 
glycosylation sites that are not present in the native sequence. In addition, the phrase 
includes qualitative changes in the glycosylation of the native proteins, involving a 
change in the nature and proportions of the various carbohydrate moieties present. 
Addition of glycosylation sites to the polypeptide can be accomplished by altering the 
amino acid sequence. The alteration can be made, for example, by the addition of, or 
substitution by, one or more serine or threonine residues to the native sequence (for O 
linked glycosylation sites). The amino acid sequence can optionally be altered through 
changes at the DNA level, particularly by mutating the DNA encoding the 
polypeptide at preselected bases such that codons are generated that will translate into 
the desired amino acids. 

[00145] Another means of increasing the number of carbohydrate moieties on 

the polypeptide is by chemical or enzymatic coupling of glycosides to the 
polypeptide. Such methods are described in the art, e.g., in WO 87/05330 published 
1 1 September 1987, and in Aplin and Wriston, CRC CriY. Rev. Biochem., pp. 259-306 
(1981). 

[00146] Removal of carbohydrate moieties present on the polypeptide can be 
accomplished chemically or enzymatically or by mutational substitution of codons 
encoding for amino acid residues that serve as targets for glycosylation. Chemical 
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deglycosylation techniques are known in the art and described, for instance, by 
Hakimuddin, et aL, Arch. Biochem. Biophys , 259:52 (1987) and by Edge et aL, Anal. 
Biochem,, 1 18:131 (1981). Enzymatic cleavage of carbohydrate moieties on 
polypeptides can be achieved by the use of a variety of endo- and exo-glycosidases as 
described by Thotakura et aL, Meth. EnzymoL, 138:350 (1987). 
[00147] Another type of covalent modification of a protein or antibody of the 
present invention comprises linking the polypeptide or antibody to one of a variety of 
non-proteinaceous polymers, e.g., polyethylene glycol (PEG), polypropylene glycol, 
or polyoxyalkylenes, in the manner set forth in U.S. Patent Nos. 4,640,835; 
4,496,689; 4,301,144; 4,670,417; 4,791,192 or 4,179,337. 

[00148] Functional groups capable of reacting with either the amino terminal a 

-amino group or £-amino groups of lysines found on the polypeptide of the present 
invention, agonist, antagonist, or antibody include: carbonates such as the p- 
nitrophenyl, or succinimidyl; carbonyl imidazole; azlactones; cyclic imide thiones; 
isocyanates or isothiocyanates; tresyl chloride (EP 714 402, EP 439 508); and . 
aldehydes. Functional groups capable of reacting with carboxylic acid groups, 
reactive carbonyl groups and oxidized carbohydrate moieties on the polypeptide of 
* the present invention, agonist, antagonist, or antibody include; primary amines; and 
hydrazine and hydrazide functional groups such as the acyl hydrazides, carbazates, 
semicarbamates, thiocarbazates, etc. Mercapto groups, if available on the polypeptide 
of the present invention, agonist, antagonist, or antibody, can also be used as 
attachment sites for suitably activated polymers with reactive groups such as thiols; 
maleimides, sulfones, and phenyl glyoxals; see, for example, U.S. Pat. No. 5,093,531, 
the disclosure of which is hereby incorporated by reference. Other nucleophiles 
capable of reacting with an electrophilic center include, but are not limited to, for 
example, hydroxyl, amino, carboxyl, thiol, active methylene and the like. 
[00149] In one preferred embodiment of the invention secondary amine or 
amide linkages are formed using the polypeptide of the present invention, agonist, 
antagonist, or antibody N-terminal amino groups or e-amino groups of lysine and the 
activated PEG. In another preferred aspect of the invention, a secondary amine 
linkage is formed between the N-terminal primary amino group of polypeptide of the 
present invention, agonist, antagonist, or antibody and single or branched chain PEG 
aldehyde by reduction with a suitable reducing agent such as NaCNBHs, NaBHs, 
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Pyridine Borane etc. as described in Chamow et ai, Bioconjugate Chem. 5: 133-140 
(1994) and US Pat. No 5,824,784. 

[00150] In another preferred embodiment of the invention, polymers activated 

with amide-forming linkers such as succinimidyl esters, cyclic imide thiones, or the 
like are used to effect the linkage between the polypeptide of the present invention, 
agonist, antagonist, or antibody and polymer, see for example, U.S. Pat. No. 
5,349,00i ; U.S. Pat. No. 5,405,877; and Greenwald, et aL, Criu Rev, Ther, Drug 
Carrier SysL 17:101-161, 2000, which are incorporated herein by reference. One 
preferred activated poly(ethylene glycol), which may be bound to the free amino 
groups of polypeptide of the present invention, agonist, antagonist, or antibody 
includes single or branched chain N-hydroxysuccinyl imide poly (ethylene glycol) may 
be prepared by activating succinic acid esters of poly(ethylene glycol) with N- 
hydroxysuccinylimide. 

[00151] Other preferred embodiments of the invention include using other 

activated polymers to form covalent linkages of the polymer with the polypeptide of 
the present invention, agonist, antagonist, or antibody via e-arnino or other groups. 
For example,- isocyanate or isothiocyanate forms of terminally activated polymers can 
be used to form urea or thiourea-based linkages with the lysine amino groups. 
[00152] In another preferred aspect of the invention, carbamate (urethane) 

linkages are formed with protein amino groups as described in U.S. Pat. Nos. 
5,122,614, 5,324,844, and 5,612,640, which are hereby incorporated by reference. 
Examples include N-succinimidyl carbonate, para-nitrophenyl carbonate, and 
carbonyl imidazole activated polymers. In another preferred embodiment of this 
invention, a benzotriazole carbonate derivative of PEG is linked to amino groups on 
the polypeptide of the present invention, agonist, antagonist, or antibody. 
[00153] The protein of the present invention can also be modified in a way to 

form a chimeric molecule comprising a protein of the present invention fused to 
another, heterologous polypeptide, or amino acid sequence. 

[00154] In one embodiment, such a chimeric molecule comprises a fusion of a 

protein of the present invention with a tag polypeptide, which provides an epitope to 
which an anti-tag antibody can selectively bind. The epitope tag is generally placed at 
the amino- or carboxyl- terminus of the protein. The presence of such epitope-tagged 
forms of a protein of the present invention can be detected using an antibody against 
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the tag polypeptide. Also, provision of the epitope tag enables the protein to be readily 
purified by affinity purification.using an anti-tag antibody or another type of affinity 
matrix that binds to the epitope tag. Various tag polypeptides and their respective 
antibodies are well known in the art. Examples include poly-histidine (poly-His) or 
poly-histidine-glycine (poly-his-gly) tags; the flu HA tag polypeptide and its antibody 
12CA5 (Field etai, MoL Cell Biol, 8:2159-2165 (1988)); the c-myc tag and the 8F9, 
3C7, 6E10, G4, B7 and 9E10 antibodies thereto (Evan et al. Molecular and Cellular 
Biology, 5:3610-3616 (1985)); and the Herpes Simplex virus glycoprotein D (gD) tag 
and its antibody (Paborsky et al. Protein Engineering, 3(6):547-553 (1990)). Other 
tag polypeptides include the Flag-peptide (Hopp et al, BioTechnology, 6:1204-1210 
(1988)); the KT3 epitope peptide (Martin et al. Science, 255: 192-194 (1992)); an a- 
tubulin epitope peptide (Skinner et al, J, Biol Chem., 266:15163-15166 (1991)); and 
the T7 gene 10 protein peptide tag (Lutz-Freyermuth etal, Proc. Natl Acad. Scl 
f/5A, 87:63936397 (1990)). 

[00155] In an alternative embodiment, the chimeric molecule can comprise a 

fusion of the polypeptide with an immunoglobulin or a particular region of an 
immunoglobulin. For a bivalent form of the chimeric molecule (also referred to as an 
"immunoadhesin"), such a fusion could be to the Fc region of an IgG molecule. The 
Ig fusions preferably include the substitution of a soluble (transmembrane domain 
deleted or inactivated) form of a polypeptide in place of at least one variable region 
within an Ig molecule. In a particularly preferred embodiment, the immunoglobulin 
fusion includes the hinge, CH2 and CH3, or the hinge, CHI, CH2 and CH3 regions iof 
an IgGl molecule. For the production of immunoglobulin fusions see also US Patent 
No. 5,428,130 issued June 27, 1995. 

[00156] In another embodiment, the chimeric molecule includes a fusion of a 

protein of the present invention with a signal peptide to allow or enhance secretion of 
the peptide or even to change its localization within the host cell. The signal sequence 
is generally placed at the amino- or carboxyl- terminus of a protein of the present 
invention, more usually the N-terminus when secretion or membrane localization is 
desired. Such fusions are typically intermediate products, since the signal peptide is 
usually specifically cleaved by enzymes of the host cell. Provision of a signal peptide 
enables the protein to be readily purified following its secretion to the culture 
medium. Various signal polypeptides, which allow secretion or targeting to 
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compartments within the cell, are well known in the art and are available for use with 
numerous host cells, including yeast and mammalian cells. 
[00157] Polynucleotides of the present invention can also be used for gene 

therapy. Gene therapy refers to therapy that is performed by the administration of a 
specific nucleic acid to a subject. Delivery of the therapeutic nucleic acid into a 
mammalian subject may be either direct {i.e., the patient is directly exposed to the 
nucleic acid or nucleic acid-containing vector) or indirect (i.e., cells are first 
transformed with the nucleic acid in vitro, then transplanted into the patient). These 
two approaches are known, respectively, as in vivo or ex vivo gene therapy. 
Polynucleotides of the invention may also be administered by other known methods 
for introduction of nucleic acid into a cell or organism (including, without limitation, 
in the form of viral vectors or naked DNA). Any of the methodologies relating to gene 
therapy available within the art may be used in the practice of the present invention. 
See e.g.. Gene Therapy of Cancer: Translational Approaches from Preclinical Studies 
to Clinical Implementation E. C. Lattime & S. L. Gerson, eds Academic Press, 2002. ^ 
[00158] Cells may also be cultured ex vivo in the presence of therapeutic 

agents or proteins of the present invention in order to proliferate or to produce a 
desired effect on or activity in such cells. Treated cells can then be introduced in vivo 
for therapeutic purposes. 

Rational Drug Design 

[00159] The goal of rational drug design is to produce structural analogs of 

biologically active polypeptide of interest or of small molecules with which they 
interact, e.g., agonists, antagonists, or inhibitors. Any of these examples can be used 
to fashion drugs which are more active or stable forms of the polypeptide or which 
enhance or interfere with the function of the polypeptide in vivo (c.f., Hodgson, 
Bio/Technology, 9: 19-21 (1991)). 

[00160] In one approach, the three-dimensional structure of the polypeptide, or 

of a polypeptide-inhibitor complex, is determined by x-ray crystallography, by 
computer modeling or, most typically, by a combination of the two approaches. Both 
the shape and charges of the polypeptide must be ascertained to elucidate the structure 
and to determine active site(s) of the molecule. Less often, useful information 
regarding the structure of the polypeptide can be gained by modeling based on the 
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stnicture of homologous proteins. In both cases, relevant structural information is 
used to design analogous polypeptide-like molecules or to identify efficient inhibitors. 
Useful examples of rational drug design can include molecules which have improved 
activity or stability as shown by Braxton and Wells, Biochemistry, 31 :7796-7801 
(1992) or which act as inhibitors, agonists, or antagonists of native peptides as shown 
by Athauda et al, /. Biochem., 1 13:742-746 (1993). 

[00161] It is also possible to isolate a target-specific antibody, selected by 

functional assay, as described above, and then to solve its crystal structure. This 
approach, in principle, yields a pharmacore upon which subsequent drug design can 
be based. It is possible to bypass protein crystallography altogether by generating 
anti-idiotypic antibodies (anti-ids) to a functional, pharmacologically active antibody. 
As a mirror image of a mirror image, the binding site of the anti-ids would be 
expected to be an analog of the original receptor. The anti-id could then be used to 
identify and isolate peptides from banks of chemically or biologically produced 
peptides. The isolated peptides would then act as the pharmacore. 
[00162] By virtue of the present invention, sufficient amounts of the 

polypeptide can be made available to perform such analytical studies as X-ray 
crystallography. In addition, knowledge of the polypeptide amino acid sequence 
provided herein will provide guidance to those employing computer modeling 
techniques in place of or in addition to x-ray crystallography. 

Nucleic Acid 

[00163] The present invention further provides isolated nucleic acid molecules 

that encode a peptide or protein of the present invention. Nucleic acid molecules will 
consist of, consist essentially of, or comprise a nucleotide sequence that encodes one 
of the peptides of the present invention, an allelic variant thereof, or an ortholog or 
paralog thereof. 

[00164] As used herein, an "isolated" nucleic acid molecule is one that is 

separated from other nucleic acid present in the natural source of the nucleic acid. 
Preferably, an "isolated" nucleic acid is free of sequences which naturally flank the 
nucleic acid (i.e., sequences located at the 5* and 3' ends of the nucleic acid) in the 
genomic DNA of the organism from which the nucleic acid is derived. The important 
point is that the nucleic acid is isolated from remote and unimportant flanking 
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sequences such that it can be subjected to the specific manipulations described herein 
such as recombinant expression, preparation of probes and primers, and other uses 
specific to the nucleic acid sequences. 

[00165] Moreover, an "isolated" nucleic acid molecule, such as a 

transcript/cDNA molecule, can be substantially free of other cellular material, or 
culture medium when produced by recombinant techniques, or chemical precursors or 
other chemicals when chemically synthesized. However, the nucleic acid molecule 
can be fused to other coding or regulatory sequences and still be considered isolated. 
[00166] For example, recombinant DNA molecules contained in a vector are 

considered isolated. Further examples of isolated DNA molecules include 
recombinant DNA molecules maintained in heterologous host cells or purified 
(partially or substantially) DNA molecules in solution. Isolated RNA molecules 
include in vivo or in vitro RNA transcripts of the isolated DNA molecules of the 
present invention. Isolated nucleic acid molecules according to the present invention 
further include such molecules produced synthetically. 

[00167] Accordingly, the present invention provides nucleic acid molecules 

that comprises the nucleotide sequence shown in SEQ ID NOs: 1-38, or any nucleic 
acid molecule that encodes the protein provided in SEQ ED NOs:39-76. A nucleic 
acid molecule comprisise a nucleotide sequence when the nucleotide sequence 
includes the nucleotide sequence of the nucleic acid molecule 
[00168] Accordingly, the present invention provides nucleic acid molecules 

that comprise the nucleotide sequence shown in SEQ ID NOs: 1-1 1 , SEQ ID NOs: 13- 
38, and SEQ ID NO:31 13, or any nucleic acid molecule that encodes the protein 
provided in SEQ ID NOs:39-49, SEQ ID NOs:51-76, and SEQ ID NO:3114. A 
nucleic acid molecule consists of a nucleotide sequence when the nucleotide sequence 
is the complete nucleotide sequence of the nucleic acid molecule. 
[00169] Accordingly, the present invention provides nucleic acid molecules 

that consist of the nucleotide sequence shown in SEQ ID NOs: 1-11, SEQ ID NOs: 1 3- 
38, and SEQ ID NO:3 1 13, or any nucleic acid molecule that encodes the protein 
provided in SEQ ID NOs:39-49, SEQ ID NOs:51-76, and SEQ ID NO:31 14. A 
nucleic acid molecule consists of a nucleotide sequence when the nucleotide sequence 
is the complete nucleotide sequence of the nucleic acid molecule. 
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[00170] The present invention further provides nucleic acid molecules that 

consist essentially of the nucleotide sequence shown in SEQ ID NOs: 1-1 1, SEQ ID 
NOs: 13-38, and SEQ ID NO:31 13, or any nucleic acid molecule that encodes the 
protein provided in SEQ ID NOs:39-49, SEQ ID NOs:51-76, and SEQ ID NO:3114. 
A nucleic acid molecule consists essentially of a nucleotide sequence when such a 
nucleotide sequence is present with only a few additional nucleic acid residues in the 
final nucleic acid molecule. 

[00171] The isolated nucleic acid molecules can encode the mature protein plus 
additional amino or carboxyl-terminal amino acids, or amino acids interior to the 
mature peptide (when the mature form h£is more than one peptide chain, for instance). 
Such sequences may play a role in processing of a protein from precursor to a mature 
form, facilitate protein trafficking, prolong or shorten protein half-life or facilitate 
manipulation of a protein for assay or production, among other things. As generally is 
the case in situ, the additional amino acids may be processed away from the mature 
protein by cellular enzymes. 

[00172] As mentioned above, the isolated nucleic acid molecules include, but 

are not limited to, the sequence encoding the peptide alone, the sequence encoding the . 
mature peptide and additional coding sequences, such as a leader or secretory 
sequence (e.g., a pre-pro or pro-protein sequence), the sequence encoding the mature* 
peptide, with or without the additional coding sequences, plus additional non-coding 
sequences, for example introns and non-coding 5* and 3' sequences such as transcribed 
but non-translated sequences that play a role in transcription, mRNA processing 
(including splicing and polyadenylation signals), ribosome binding and stability of 
mRNA. In addition, the nucleic acid molecule may be fused to a marker sequence 
encoding, for example, a peptide that facilitates purification. 
[00173] Isolated nucleic acid molecules can be in the form of RNA, such as 

mRNA, or in the form DNA, including cDNA and genomic DNA obtained by cloning 
or produced by chemical synthetic techniques or by a combination thereof. The 
nucleic acid, especially DNA, can be double-stranded or single-stranded. Single- 
stranded nucleic acid can be the coding strand (sense strand) or the non-coding strand 
(anti-sense strand). 

[00174] The invention further provides nucleic acid molecules that encode 
fragments of the peptides of the present invention as well as nucleic acid molecules 
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that encode obvious variants of the proteins of the present invention that are described 
above. Such nucleic acid molecules may be naturally occurring, such as allelic 
variants (same locus), paralogs (different locus), and orthologs (different organism), 
or may be constructed by recombinant DNA methods or by chemical synthesis. Such 
non-naturally occurring variants may be made by mutagenesis techniques, including 
those applied to nucleic acid molecules, cells, or organisms. Accordingly, as 
discussed above, the variants can contain nucleotide substitutions, deletions, 
inversions and insertions. Variation can occur in either or both the coding and non- 
coding regions. The variations can produce both conservative and non-conservative 
^amino acid substitutions. 
[00175] A fragment comprises a contiguous nucleotide sequence greater than 

12 or more nucleotides. Further, a fragment could be at least 30, 40, 50, 100, 250 or 
500 nucleotides in length depending on the length of the original nucleotide sequence. 
The length of the fragment will be based on its intended use. For example, the 
fragment can encode epitope bearing regions of the peptide, or can be useful as DNA 
probes and primers. Such fragments can be isolated using the known nucleotide 
sequence to synthesize an oligonucleotide probe. A labeled probe can then be used to 
screen a cDNA library, genomic DNA library, or mRNA to isolate nucleic acid 
corresponding to the coding region. Further, primers can be used in PCR reactions to 
clone specific regions of gene. 

[00176] A probe/primer typically comprises substantially a purified . 

oligonucleotide or oligonucleotide pair. The oligonucleotide typically comprises a 
region of nucleotide sequence that hybridizes under stringent conditions to at least 
about 12, 20, 25, 40, 50 or more consecutive nucleotides. 

[00177] Orthologs, homologs, and allelic variants can be identified using 

methods well known in the art. These variants comprise a nucleotide sequence 
encoding a peptide that is typically 60-70%, 70-80%, 80-90%, and more typically at 
least about 90- 95% or more homologous to the nucleotide sequence. Such nucleic 
acid molecules can readily be identified as being able to hybridize under moderate to 
stringent conditions. Allelic variants can readily be determined by genetic locus of 
the encoding gene 

[00178] As used herein, the term "hybridizes under stringent conditions" is 

intended to describe conditions for hybridization and washing under which nucleotide 
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sequences encoding a peptide at least 60-70% homologous to each other typically 
remain hybridized to each other. The conditions can be such that sequences at least 
about 60%, at least about 70%, at least about 80%, at least about 90%, at least about 
95%, at least about 98%, or at least about 99% or more homologous to each other 
typically remain hybridized to each other. Such stringent conditions are known to 
those skilled in the art and can be found in Current Protocols in Molecular Biology, 
John Wiley & Sons, N.Y. (1989), 6.3.1-6.3.6. One example of stringent hybridization 
conditions are hybridization in 6x SSC (sodium chloride/sodium citrate) at about 
45°C., followed by one or more washes in 0.2xSSC, 0.1% SDS at 50-65°C. Examples 
of moderate to low stringency hybridization conditions are well known in the art. 

Nucleic Acid Molecule Uses 

[00179] The nucleic acid molecules of the present invention are useful for 

probes, primers, chemical intermediates, and in biological assays. The nucleic acid 
molecules are useful as a hybridization probe for messenger RNA, transcript/cDNA 
and genomic DNA to isolate full-length cDNA and genomic clones encoding the 
peptides shown in SEQ ID NOs:39-49, SEQ ID NOs:51-76, and SEQ ID NO:31 14 
and to isolate cDNA and genomic clones that correspond to variants (alleles, 
orthologs, etc.) producing the same or related peptides shown in SEQ ID NOs:39-49, 
SEQ ID NOs:51-76, and SEQ ID NO:31 14. 

[00180] The probe can correspond to any sequence along the entire length of 

the nucleic acid molecules. Accordingly, it could be derived from 5' noncoding 
regions, the coding region, and 3* noncoding regions. 

[00181] The nucleic acid molecules are also useful as primers for PGR to 
amplify any given region of a nucleic acid molecule and are useful to synthesize 
antisense molecules of desired length and sequence. 
[00182] The nucleic acid molecules are also useful for constructing 

recombinant vectors. Such vectors include expression vectors that express a portion 
of, or all of, the peptide sequences. Vectors also include insertion vectors, used to 
integrate into another nucleic acid molecule sequence, such as into the cellular 
genome, to alter in situ expression of a gene and/or gene product. For example, an 
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endogenous coding sequence can be replaced via homologous recombination with all 
or part of the coding region containing one or more specifically introduced mutations. 
[00183] The nucleic acid molecules are also useful for expressing antigenic 
portions of the proteins. 

[00184] The nucleic acid molecules are also useful as probes for determining 

the chromosomal positions of the nucleic acid molecules by means of in situ 
hybridization methods. 

[00185] The nucleic acid molecules are also useful in making vectors 
containing the gene regulatory regions of the nucleic acid molecules of the present 
invention. 

[00186] The nucleic acid molecules are also useful for designing ribozymes 

corresponding to all, or a part, of the mRNA produced from the nucleic acid 
molecules described herein. 

[00187] The nucleic acid molecules are also useful for making vectors that 

express part, or all, of the peptides. ^ 

[00188] The nucleic acid molecules are also useful for constructing host cells 
expressing a part, or all, of the nucleic acid molecules and peptides. 
[00189] The nucleic acid molecules are also useful for constructing transgenic 

animals expressing all, or a part, of the nucleic acid molecules and peptides. 
[00190] The nucleic acid molecules are also useful as hybridization probes for 

determining the presence, level, form and distribution of nucleic acid expression. 
Accordingly, the probes can be used to detect the presence of, or to determine levels 
of, a specific nucleic acid molecule in cells, tissues, and in organisms. The nucleic 
acid whose level is determined can be DNA or RNA. Accordingly, probes 
corresponding to the peptides described herein can be used to assess expression and/or 
gene copy number in a given cell, tissue, or organism. These uses are relevant for 
diagnosis of disorders involving an increase or decrease in protein expression relative 
to normal results. 

[00191] Probes can be used as a part of a diagnostic test kit for identifying cells 

or tissues that express a protein. 
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Nucleic Acid Expression Assays 

[00192] Nucleic acid expression assays are useful for drug screening to 

identify compounds that modulate nucleic acid expression, 

[00193] The compounds can be used to treat a disorder associated with nucleic 

acid expression of the gene, particularly biological and pathological processes that are 
mediated by the gene in cells and tissues that express it. The method typically ' 
includes assaying the ability of the compound to modulate the expression of the 
nucleic acid and thus identifying a compound that can be used to treat a disorder 
characterized by undesired nucleic acid expression. The assays can be performed in 
cell-based and cell-free systems. Cell-based assays include cells naturally expressing 
the nucleic acid or recombinant cells genetically engineered to express specific 
nucleic acid sequences. 

[00194] The assay for nucleic acid expression can involve direct assay of 

nucleic acid levels, such as mRNA levels. In this embodiment the regulatory regions 
of these genes can be operably linked to a reporter gene such as luciferase. 
[00195] Thus, modulators of gene expression can be identified in a method 
wherein a cell is contacted with a candidate compound and the expression of mRNA 
determined. The level of expression of mRNA in the presence of the candidate 
compound is compared to the level of expression of mRNA in the absence of the 
candidate compound. The candidate compound can then be identified as a modulator 
of nucleic acid expression based on this comparison and be used, for example to treat 
a disorder characterized by aberrant nucleic acid expression. When expression of 
mRNA is statistically significantly greater in the presence of the candidate compound 
than in its absence, the candidate compound is identified as a stimulator of nucleic 
acid expression. When nucleic acid expression is statistically significantly less in the 
presence of the candidate compound than in its absence, the candidate compound is 
identified as an inhibitor of nucleic acid expression. 

[00196] The invention further provides methods of treatment, with the nucleic 
acid as a target, using a compound identified through drug screening as a gene 
modulator to modulate nucleic acid expression in cells and tissues that express the 
nucleic acid. Modulation includes both up-regulation (i.e. activation or agonization) 
or down-regulation (suppression or antagonization) or nucleic acid expression. 
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[00197] Alternatively, a modulator for nucleic acid expression can be a small 

molecule or drug identified using the screening assays described herein as long as the 
drug or small molecule inhibits the nucleic acid expression in the cells and tissues that 
express the protein. 

[00198] The nucleic acid molecules are also useful for monitoring the 
effectiveness of modulating compounds on the expression or activity of the gene in 
clinical trials or in a treatment regimen. Thus, the gene expression pattern can serve as 
a barometer for the continuing effectiveness of treatment with the compound, 
particularly with compounds to which a patient can develop resistance. The gene 
expression pattern can also serve as a marker indicative of a physiological response of 
the affected cells to the compound. Accordingly, such monitoring would allow either 
increased administration of the compound or the administration of alternative 
compounds to which the patient has not become resistant. Similarly, if the level of 
nucleic acid expression falls below a desirable level, administration of the compound 
could be commensurately decreased. 

Nucleic acid diagnostics 

[00199] The nucleic acid molecules are also useful in diagnostic assays for 

qualitative changes in nucleic acid expression, and particularly in qualitative changes 
that lead to pathology. The nucleic acid molecules can be used to detect mutations in 
genes and gene expression products such as mRNA. The nucleic acid molecules can 
be used as hybridization probes to detect naturally occurring genetic mutations in the 
gene and thereby to determine whether a subject with the mutation is at risk for a 
disorder caused by the mutation. Mutations include deletion, addition, or substitution 
of one or more nucleotides in the gene, chromosomal rearrangement, such as 
inversion or transposition, modification of genomic DNA, such as aberrant 
methylation patterns or changes in gene copy number, such as amplification. 
Detection of a mutated form of the gene associated with a dysfunction provides a 
diagnostic tool for an active disease or susceptibility to disease when the disease 
results from overexpression, underexpression, or altered expression of a protein. 
[00200] Individuals carrying mutations in the gene can be detected at the 
nucleic acid level by a variety of techniques. Genomic DNA can be analyzed directly 
or can be amplified by using PCR prior to analysis. RNA or cDNA can be used in the 
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same way. In some uses, detection of the mutation involves the use of a probe/primer 
in a polymerase chain reaction (PGR) (see, e.g. U.S. Pat. Nos. 4,683,195 and 
4,683,202), such as anchor PGR or RAGE PGR, or, alternatively, in a ligation chain 
reaction (LGR) (see, e.g. , Landegran et al., Science 241 : 1077-1080 (1988); and 
Nakazawa et al., PiVA5 9 1 :360-364 (1994)), the latter of which can be particularly 
useful for detecting point mutations in the gene (see Abravaya et al.. Nucleic Acids 
Res. 23:675-682 (1995)). This method can include the steps of collecting a sample of 
cells from a patient, isolating nucleic acid (e.g., genomic, mRNA or both) from the 
cells of the sample, contacting the nucleic acid sample with one or more primers 
which specifically hybridize to a gene under conditions such that hybridization and 
amplification of the gene (if present) occurs, and detecting the presence or absence of 
an amplification product, or detecting the size of the amplification product and 
comparing the length to a control sample. Deletions and insertions can be detected by 
a change in size of the amplified product compared to the normal genotype. Point 
mutations can be identified by hybridizing amplified DNA to normal RNA or 
antisense DNA sequences. 

[00201] Alternatively, mutations in a gene can be directly identified, for 

example, by alterations in restriction enzyme digestion patterns determined by gel 
electrophoresis. 

[00202] Further, sequence-specific ribozymes (U.S. Pat. No. 5,498,531) can be 

used to score for the presence of specific mutations by development or loss of a 
ribozyme cleavage site. Perfectly matched sequences can be distinguished from 
mismatched sequences by nuclease cleavage digestion assays or by differences in 
melting temperature. 

[00203] Sequence changes at specific locations can also be assessed by 

nuclease protection assays such as RNase and SI protection or the chemical cleavage 
method. Furthermore, sequence differences between a mutant gene and a wild-type 
gene can be determined by direct DNA sequencing. A variety of automated 
sequencing procedures can be utilized when performing the diagnostic assays (Naeve, 
G. W., (1995) Biotechniques 19:448), including sequencing by mass spectrometry 
(see, e.g., PGT International Publication No. WO 94/16101 ; Gohen et al,. Adv. 
Chromatogr. 36:127-162 (1996); and Griffin et d\.,Appl. Biochem. Biotechnol. 
38:147-159(1993)). 
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[00204] Other methods for detecting mutations in the gene include methods in 

which protection from cleavage agents is used to detect mismatched bases in 
RNA/RNA or RNA/DNA duplexes (Myers et al., Science 230: 1242 (1985)); Cotton et 
al., PNAS 85:4397 (1988); Saleeba et al., Meth. EnzymoL 217:286-295 (1992)), 
electrophoretic mobility of mutant and wild type nucleic acid is compared (Orita et 
al., PNAS 86:2766 (1989); Cotton et al., Mutat. Res. 285:125-144 (1993); and Hayashi 
et al., Genet. Anal. Tech. Appl. 9:73-79 (1992) ), and movement of mutant or wild- 
type fragments in polyacrylamide gels containing a gradient of denaturant is assayed 
using denaturing gradient gel electrophoresis (Myers et al., Nature 313:495 (1985)). 
Examples of other techniques for detecting point mutations include selective 
oligonucleotide hybridization, selective amplification, and selective primer extension. 
[00205] Gene amplification and/or expression can be measured in a sample 

directly. Alternatively, antibodies can be employed that can recognize specific 
duplexes, including DNA duplexes, RNA duplexes, and DNA-RNA hybrid duplexes 
or DNA-protein duplexes. The antibodies in turn can be labeled and the assay can be 
carried out where the duplex is bound to a surface, so that upon the formation of 
duplex on the surface, the presence of antibody bound to the duplex can be detected. 
[00206] Gene expression, alternatively, can be measured by immunological 

methods, such as imrnunohistochemical staining of cells or tissue sections and assay 
of cell culture or body fluids,no quantitate directly the expression of gene product. 
Antibodies useful for immunohistochemical staining and/or assay of sample fluids can 
be either monoclonal or polyclonal, and can be prepared in any mammal. 
Conveniently, the antibodies can be prepared against a native sequence polypeptide or 
against a synthetic peptide based on the DNA sequences provided herein or against 
exogenous sequence fused to DNA and encoding a specific antibody epitope. 
[00207] The nucleic acid molecules are also useful for testing an individual for 

a genotype that while not necessarily causing the disease, nevertheless affects the 
treatment modality. Thus, the nucleic acid molecules can be used to study the 
relationship between an individual's genotype and the individual's response to a 
compound used for treatment (pharmacogenomic relationship). Accordingly, the 
nucleic acid molecules described herein can be used to assess the mutation content of 
the gene in an individual in order to select an appropriate compound or dosage 
regimen for treatment. 
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[00208] Thus nucleic acid molecules displaying genetic variations that affect 

treatment provide a diagnostic target that can be used to tailor treatment in an 
individual. Accordingly, the production of recombinant cells and animals containing 
these polymorphisms allow effective clinical design of treatment compounds and 
dosage regimens. 

[00209] The present invention is directed to antisense compounds, particularly 

oligonucleotides, which are targeted to a nucleic acid encoding polypeptide of the 
present invention, and which modulate the expression of the polypeptide of the 
present invention. Pharmaceutical and other compositions comprising the antisense 
compounds of the invention are also provided. Further provided are methods of 
modulating the expression of the polypeptide of the present invention in cells or 
tissues comprising contacting said cells or tissues with one or more of the antisense 
compounds or compositions of the invention. Further provided are methods of treating 
an animal, particularly a human, suspected of having or being prone to a disease or 
condition associated with expression of the polypeptide of the present invention by 
administering a therapeutically or prophylactically effective amount of one or more of 
the antisense compounds or compositions of the invention. 

[00210] The nucleic acid molecules are thus useful as antisense constructs to 

control gene expression in cells, tissues, and organisms. A DNA antisense nucleic 
acid molecule is designed to be complementary to a region of the gene involved in 
transcription, preventing transcription and hence production of protein. An antisense 
RNA or DNA nucleic acid molecule would hybridize to the mRNA and thus block 
translation of mRNA into protein, if targeted to the initiator AUG codon. 
Alternatively, a DNA antisense molecule can modulate gene expression by 
stimulating RNAseH-dependent degradation of the target mRNA. Antisense 
oligonucleotides are anticipated to also contain modified nucleic acids that impart 
improved properties such as phosphorothioate, polyamide or morpholino groups. 
[00211] Alternatively, a class of antisense molecules can be used to inactivate 

mRNA in order to decrease expression of nucleic acid. Accordingly, these molecules 
can treat a disorder characterized by abnormal or undesired nucleic acid expression. 
This technique involves cleavage by means of ribozymes containing nucleotide 
sequences complementary to one or more regions in the mRNA that attenuate the 
ability of the mRNA to be translated. Possible regions include coding regions and 
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particularly coding regions corresponding to the catalytic and other functional 
activities of the protein, such as substrate binding. 

[00212] In addition, nucleic acids can be used to generate small interfering 

RNAs (siRNA) which, when delivered to a cell expressing the target mRNA, can lead 
to a decrease in the level of that RNA and thus a decrease in protein expressed by that 
RNAiBlbashitQial., Genes and Development. 15: 188-200,2001). Generation of 
siRNAs can be accomplished using commercially available kits (MEGAscript RNAi 
Kit, Ambion, Inc. Austin, Tx) 

[00213] The nucleic acid molecules also provide vectors for gene therapy in 

patients containing cells that are aberrant in gene expression. Thus, recombinant cells, 
which include the patient's cells that have been engineered ex vivo and returned to the 
patient, are introduced into an individual where the cells produce the desired protein 
to treat the individual. 

[00214] The invention also encompasses kits for detecting the presence of a 
nucleic acid in a biological sample. For example, the kit can comprise reagents such 
as a labeled or labelable nucleic acid or agent capable of detecting nucleic acid in a 
biological sample; means for determining the amount of nucleic acid in the sample; 
and means for comparing the amount of nucleic acid in the sample with a standard. 
The compound or agent can be packaged in a suitable container. The kit can further 
comprise instructions for using the kit to detect protein mRNA or DNA. 

Vectors/host cells 

[00215] The invention also provides vectors containing the nucleic acid 

molecules described herein. The term "vector" refers to a vehicle, preferably a nucleic 
acid molecule, which can transport the nucleic acid molecules. When the vector is a 
nucleic acid molecule, the nucleic acid molecules are covalently linked to the vector 
nucleic acid. With this aspect of the invention, the vector includes a plasmid, single 
or double stranded phage, a single or double stranded RNA or DNA viral vector, or 
artificial chromosome, such as a BAG, PAG, YAG, OR MAG. 
[00216] A vector can be maintained in the host cell as an extrachromosomal 
element where it replicates and produces additional copies of the nucleic acid 
molecules. Alternatively, the vector may integrate into the host cell genome and 
produce additional copies of the nucleic acid molecules when the host cell replicates. 
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[00217] The invention provides vectors for the maintenance (cloning vectors) 

or vectors for expression (expression vectors) of the nucleic acid molecules. The 
vectors can function in prokaryotic or eukaryotic cells or in both (shuttle vectors). 
[00218] Expression vectors contain cis-acting regulatory regions that are 

operably linked in the vector to the nucleic acid molecules such that transcription of 
the nucleic acid molecules is allowed in a host cell. The nucleic acid molecules can be 
introduced into the host cell with a separate nucleic acid molecule capable of affecting 
transcription. Thus, the second nucleic acid molecule may provide a transacting factor 
interacting with the cis-regulatory control region to allow transcription of the nucleic 
acid molecules from the vector. Alternatively, a trans- acting factor may be supplied 
by the host cell. Finally, a trans-acting factor can be produced from the vector itself. It 
is understood, however, that in some embodiments, transcription and/or translation of 
the nucleic acid molecules can occur in a cell-free system. 
[00219] The regulatory sequences to which the nucleic acid molecules 
described herein can be operably linked include promoters for directing mRNA 
transcription. These include, but are not limited to, the left promoter from 
bacteriophage X, the lac, TRP, and TAC promoters from E. coli, the early and late 
promoters from SV40, the CMV immediate early promoter, the adenovirus early and 
late promoters, and retrovirus long-terminal repeats. 

[00220] In addition to control regions that promote transcription, expression 

vectors may also include regions that modulate transcription, such as repressor 
binding sites and enhancers. Examples include the SV40 enhancer, the 
cytomegalovirus immediate early enhancer, polyoma enhancer, adenovirus enhancers, 
and retrovirus LTR enhancers. 

[00221] In addition to containing sites for transcription initiation and control, 

expression vectors can also contain sequences necessary for transcription termination 
and, in the transcribed region a ribosome binding site for translation. Other regulatory 
control elements for expression include initiation and termination codons as well as 
polyadenylation signals. The person of ordinary skill in the art would be aware of the 
numerous regulatory sequences that are useful in expression vectors. Such regulatory 
sequences are described, for example, in Sambrook et al.. Molecular Cloning: A 
Laboratory Manual, 3rd, ed.. Cold Spring Harbor Laboratory Press, Cold Spring 
Harbor, N.Y., (2001) 
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[00222] A variety of expression vectors can be used to express a nucleic acid 

molecule. Such vectors include chromosomal, episomal, and virus-derived vectors, 
for example vectors derived from bacterial plasmids, from bacteriophage, from yeast 
episomes, from yeast chromosomal elements, including yeast artificial chromosomes, 
from viruses such as baculoviruses, papovaviruses such as SV40, Vaccinia viruses, 
adenoviruses, poxviruses, pseudorabies viruses, and retroviruses. Vectors may also be 
derived from combinations of these sources such as those derived from plasmid and 
bacteriophage genetic elements, e.g. cosmids and phagemids. Appropriate cloning and 
expression vectors for prokaryotic and eukaryotic hosts are described in Sambrook et 
al., Molecular Cloning: A Laboratory Manual. 3rd. ed., Cold Spring Harbor 
Laboratory Press, Cold Spring Harbor, N.Y., (2001). 

[00223] The regulatory sequence may provide constitutive expression in one or 

more host cells (i.e. tissue specific) or may provide for inducible expression in one or 
more cell types such as by temperature, nutrient additive, or exogenous factor such as 
a hormone or other ligand. A variety of vectors providing for constitutive and 
inducible expression in prokaryotic and eukaryotic hosts are well known to those of 
ordinary skill in the art. 

[00224] The nucleic acid molecules can be inserted into the vector nucleic acid 

by well-known methodology. Generally, the DNA sequence that will ultimately be 
expressed is joined to an expression vector by cleaving the DNA sequence and the 
expression vector with one or more restriction enzymes and then ligating the 
fragments together. Procedures for restriction enzyme digestion and ligation are well 
known to those of ordinary skill in the art, 

[00225] The vector containing the appropriate nucleic acid molecule can be 

introduced into an appropriate host cell for propagation or expression using well- 
known techniques. Bacterial cells include, but are not limited to, E. coli, 
Streptomyces, and Salmonella typhimurium. Eukaryotic cells include, but are not 
limited to, yeast, insect cells such as Drosophila, animal cells such as COS and CHO 
cells, and plant cells. 

[00226] As described herein, it may be desirable to express the peptide as a 

fusion protein. Accordingly, the invention provides fusion* vectors that allow for the 
production of the peptides. Fusion vectors can increase the expression of a 
recombinant protein, increase the solubility of the recombinant protein, and aid in the 
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purification of the protein by acting for example as a ligand for affinity purification. A 
proteolytic cleavage site may be introduced at the junction of the fusion moiety so that 
the desired peptide can ultimately be separated from the fission moiety. Proteolytic 
enzymes include, but are not limited to, factor Xa, thrombin, and enterolipase. Typical 
fusion expression vectors include pGEX (Smith et al.. Gene 67:31-40 (1988)), pMAL 
(New England Biolabs, Beverly, Mass.) and pRTTS (Pharmacia, Piscataway, NJ.) 
which fuse glutathione S-transferase (GST), maltose E binding protein, or protein A, 
respectively, to the target recombinant protein. Examples of suitable inducible non- 
fiision E. coli expression vectors include pTrc (Amann et al.. Gene 69:301-315 
(1988)) and pET 1 Id (Studier et al.. Gene Expression Technology: Methods in 
Enzymology 185:60- 89 (1990)). 

[00227] Recombinant protein expression can be maximized in host bacteria by 

providing a genetic background wherein the host cell has an impaired capacity to 
proteolytically cleave the recombinant protein. (Gottesman, S., Gene Expression 
Technology: Methods in Enzymology 185, Academic Press, San Diego, Calif. 
(1990)1 19-128). Alternatively, the sequence of the nucleic acid molecule of interest 
can be altered to provide preferential codon usage for a specific host cell, for example 
E. colL (Wada et al.. Nucleic Acids Res. 20:21 1 1-2118 (1992)). 
[00228] The nucleic acid molecules can also be expressed by expression 

vectors that are operative in yeast. Examples of vectors for expression in yeast e.g., 5. 
cerevisiae include pYepSecl (Baldari, et al., EMBO 7. 6:229-234 (1987)), pMFa 
(Kurjan et al.. Cell 30:933-943(1982)) , pJRY88 (Schultz et al.. Gene 54:1 13-123 
(1987)), and pYES2 (Invitrogen Corporation, San Diego, Calif.). 
[00229] The nucleic acid molecules can also be expressed in insect cells using, 

for example, baculovirus expression vectors. Baculovirus vectors available for 
expression of proteins in cultured insect cells (e. g., Sf 9 cells) include the pAc series 
(Smith et al., MoL Cell BioL 3:2156-2165 (1983)) and the pVL series (Lucklow et al.. 
Virology 170:31-39 (1989)). 

[00230] In certain embodiments of the invention, the nucleic acid molecules 

described herein are expressed in mammalian cells using mammalian expression 
vectors. Examples of mammalian expression vectors include pCDM8 (Seed, B. 
Nature 329:840(1987)) and pMT2PC (Kaufman et al., EMBO J. 6:187-195 (1987)). 
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[00231] The expression vectors listed herein are provided by way of example 

only of the well-known vectors available to those of ordinary skill in the art that 
would be useful to express the nucleic acid molecules. The person of ordinary skill in 
the art would be aware of other vectors suitable for maintenance propagation or 
expression of the nucleic acid molecules described herein. These are found for 
example in Sambrook, J., Fritsh, E. F., and Maniatis, T. Molecular Cloning: A 
Laboratory Manual. 3rd, ed., Cold Spring Harbor Laboratory, Cold Spring Harbor 
Laboratory Press, Cold Spring Harbor, N.Y., 2001. 

[00232] The invention also encompasses vectors in which the nucleic acid 

sequences described herein are cloned into the vector in reverse orientation, but 
operably linked to a regulatory sequence that permits transcription of antisense RNA. 
Thus, an antisense transcript can be produced to all, or to a portion, of the nucleic acid 
molecule sequences described herein, including both coding and non-coding regions. 
Expression of this antisense RNA is subject to each of the parameters described above 
in relation to expression of the sense RNA (regulatory sequences, constitutive or 
inducible expression, tissue-specific expression). 

[00233] . The invention also relates to recombinant host cells containing the 
vectors described herein. Host cells therefore include prokaryotic cells, lower 
eukaryotic cells such as yeast, other eukaryotic cells such as insect cells, and higher 
eukaryotic cells such as mammalian cells. 

[00234] The recombinant host cells are prepared by introducing the vector 

constructs described herein into the cells by techniques readily available to the person 
of ordinary skill in the art. These include, but are not limited to, calcium phosphate 
transfection, DEAE-dextran-mediated transfection, cationic lipid-mediated 
transfection, electroporation, transduction, infection, lipofection, and other techniques 
such as those found in Sambrook, et al. (Molecular Cloning: A Laboratory Manual. 
3rd, ed.. Cold Spring Harbor Laboratory, Cold Spring Harbor Laboratory Press, Cold 
Spring Harbor, N.Y., 2001). ^ 

[00235] Host cells can contain more than one vector. Thus, different nucleotide 

sequences can be introduced on different vectors of the same cell. Similarly, the 
nucleic acid molecules can be introduced either alone or with other nucleic acid 
molecules that are not related to the nucleic acid molecules such as those providing 
trans-acting factors for expression vectors. When more than one vector is introduced 
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into a cell, the vectors can be introduced independently, co-introduced or joined to the 
nucleic acid molecule vector. 

[00236] In the case of bacteriophage and viral vectors, these can be introduced 

into cells as packaged or encapsulated virus by standard procedures for infection and 
transduction. Viral vectors can be replication-competent or replication-defective. In 
the case in which viral replication is defective, replication will occur in host cells 
providing functions that complement the defects. 

[00237] Vectors generally include selectable markers that enable the selection 

of the subpopulation of cells that contain the recombinant vector constructs. The 
marker can be contained in the same vector that contains the nucleic acid molecules 
described herein or may be on a separate vector. Markers include tetracycline or 
ampicillin-resistance genes for prokaryotic host cells and dihydrofolate reductase or 
neomycin resistance for eukaryotic host cells. However, any marker that provides 
selection for a phenotypic trait will be effective. 

[00238] While the mature proteins can be produced in bacteria, yeast, 
mammalian cells, and other cells under the control of the appropriate regulatory 
sequences, cell-free transcription and translation systems can also be used to produce 
these proteins using RNA derived from the DNA constructs described herein. 
[00239] Where secretion of the peptide is desired, which is difficult to achieve 

with multi-transmembrane domain containing proteins, appropriate secretion signals 
are incorporated into the vector. The signal sequence can be endogenous to the 
peptides or heterologous to these peptides. 

[00240] Where the peptide is not secreted into the medium the protein can be 

isolated from the host cell by standard disruption procedures, including freeze thaw, 
sonication, mechanical disruption, use of lysing agents and the like. The peptide can 
then be recovered and purified by well-known purification methods including 
ammonium sulfate precipitation, acid extraction, anion or cationic exchange 
chromatography, phosphocellulose chromatography, hydrophobic-interaction 
chromatography, affinity chromatography, hydroxylapatite chromatography, lectin 
chromatography, or high performance liquid chromatography. 
[00241] It is also understood that depending upon the host cell in recombinant 
production of the peptides described herein, the peptides can have various 
glycosylation patterns, depending upon the cell, or maybe non-glycosylated as when 
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produced in bacteria. In addition, the peptides may include an initial modified 
methionine in some cases as a result of a host-mediated process. 

Uses of vectors and host cells 

[00242] The recombinant host cells expressing the proteins described herein 

have a variety of uses. First, the cells are useful for producing protein that can be 
further purified to produce desired amounts of protein. Thus, host cells containing 
expression vectors are useful for protein production. 

[00243] Host cells are also useful for conducting cell-based assays involving 

the protein or protein fragments, such as those described above as well as other 
formats known in the art. Thus, a recombinant host cell expressing a native protein is 
useful for assaying compounds that stimulate or inhibit protein function. 
[00244] Host cells are also useful for identifying protein mutants in which these 

functions are affected. If the mutants naturally occur and give rise to a pathology, host 
cells containing the mutations are useful to assay compounds that have a desired 
effect on the mutant protein (for example, stimulating or inhibiting function) which 
may not be indicated by their effect on the native protein. 

Transgenic animals 

[00245] Genetically engineered host cells can be further used to produce non- 

human transgenic animals. A transgenic animal is preferably a mammal, for example 
a rodent, such as a rat or mouse, in which one or more of the cells of the animal 
include a transgene. A transgene is exogenous DNA which is integrated into the 
genome of a cell from which a transgenic animal develops and which remains in the 
genome of the mature animal in one or more cell types or tissues of the transgenic 
animal. These animals are useful for studying the function of a protein and identifying 
and evaluating modulators of protein activity. Other examples of transgenic animals 
include non-human primates, sheep, dogs, cows, goats, chickens, and amphibians. 
[00246] A transgenic animal can be produced by introducing nucleic acid into 

the male pronuclei of a fertilized oocyte, e.g., by microinjection, retroviral infection, 
and allowing the oocyte to develop in a pseudopregnant female foster animal. Any of 
the protein nucleotide sequences can be introduced as a transgene into the genome of 
a non-human animal, such as a mouse. 
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[00247] Any of the regulatory or other sequences useful in expression vectors 

can form part of the transgenic sequence. This includes intronic sequences and 
polyadenylation signals, if not already included. A tissue-specific regulatory 
sequence(s) can be operably linked to the transgene to direct expression of the protein 
to particular cells. 

[00248] Methods for generating transgenic animals via embryo manipulation 

and microinjection, particularly animals such as mice, have become conventional in 
the art and are described, for example, in U.S. Pat. Nos. 4,736,866 and 4,870,009, 
both by Leder et aL, U.S. Pat. No. 4, 873,191 by Wagner et al. and in Hogan, B., 
Manipulating the Mouse Embryo, (Cold Spring Harbor Laboratory Press, Cold Spring 
Harbor, N.Y. , 1986). Similar methods are used for production of other transgenic 
animals. A transgenic founder animal can be identified based upon the presence of the 
transgene in its genome and/or expression of transgenic mRNA in tissues or cells of 
the animals. A transgenic founder animal can then be used to breed additional animals 
carrying the transgene. Moreover, transgenic animals carrying a transgene can further 
be bred to other transgenic animals carrying other transgenes. A transgenic animal 
also includes animals in which the entire animal or tissues in the animal have been 
produced using the homologously recombinant host cells described herein. 
[00249] In another embodiment, transgenic non-human animals can be 

produced which contain selected systems that allow for regulated expression of the 
transgene. One example of such a system is the cre/loxP recombinase system of 
bacteriophage PL For a description of the cre/loxP recombinase system, see, e.g., 
Lakso et al. PNAS 89:6232-6236 (1992). Another example of a recombinase system is 
the FLP recombinase system of 5. cerevisiae (O'Gorman et al. Science 251:1351- 
1355 (1991). If a cre/loxP recombinase system is used to regulate expression of the 
transgene, animals containing transgenes encoding both the Cre recombinase and a 
selected protein is required. Such animals can be provided through the construction of 
"double" transgenic animals, e.g., by mating two transgenic animals, one containing a 
transgene encoding a selected protein and the other containing a transgene encoding a 
recombinase. 

[00250] Clones of the non-human transgenic animals described herein can also 

be produced according to the methods described in Wilmut, 1. et al. Nature 385:810- 
813 (1997) and PCT International Publication Nos. WO 97/07668 and WO 97/07669. 
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In brief, a cell, e.g., a somatic cell, from the transgenic animal Ccin be isolated and 
induced to exit the growth cycle and enter Go phase. The quiescent cell can then be 
fused, e.g., through the use of electrical pulses, to an enucleated oocyte from an 
animal of the same species from which the quiescent cell is isolated. The 
reconstructed oocyte is then cultured such that it develops to morula or blastocyst and 
then transferred to pseudopregnant female foster animal. The offspring bom of this 
female foster animal will be a clone of the animal from which the cell, e.g., the 
somatic cell, is isolated. 

[00251] Transgenic animals containing recombinant cells that express the 

peptides described herein are useful to conduct the assays described herein in an in 
vivo context. Accordingly, the various physiological factors that are present in vivo 
and that could effect substrate binding, and protein activation, may not be evident 
from in vitro cell-free or cell-based assays. Accordingly, it is useful to provide non- 
human transgenic animals to assay in vivo protein function, including substrate 
interaction, the effect of specific mutant proteins on protein function and substrate 
interaction, and the effect of chimeric proteins. It is also possible to assess the effect 
of null mutations, that is mutations that substantially or completely eliminate one or 
more protein functions. 

[00252] All publications and patents mentioned in the above specification are 

herein incorporated by reference. Various modifications and variations of the 
described method and system of the invention will be apparent to those skilled in the 
art without departing from the scope and spirit of the invention. Although the 
invention has been described in connection with specific preferred embodiments, it 
should be understood that the invention as claimed should not be unduly limited to 
such specific embodiments. Indeed, various modifications of the above- described 
modes for carrying out the invention which are obvious to those skilled in the field of 
molecular biology or related fields are intended to be within the scope of the 
following claims. 

Microarravs Construction and Element Selection 

[00253] A chemical coupling procedure arid an ink jet device can be used to 

synthesize array elements on the surface of a substrate. (See, e.g.. Gamble et al. US 
5981733) An array analogous to a dot or slot blot may also be used to arrange and 
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link elements to the surface of a substrate using or thermal, UV, mechanical, or 
chemical bonding procedures, or a vacuum system. A typical array may be produced 
by hand or using available methods and machines and contain any appropriate number 
of elements. After hybridization, nonhybridized probes are removed and a scanner 
used to determine the levels and patterns of fluorescence. The degree of 
complementarity and the relative abundance of each probe which hybridizes to an 
element on the microarray may be assessed through analysis of the scanned images. 
[00254] In another alternative, full-length cDNAs or Expressed Sequence Tags 
(ESTs) comprise the elements of the microarray. Full-length cDNAs or ESTs 
representing a genome scan of thouseind of unique nucleotide sequences, or 
corresponding to one of the nucleotide sequences of the present invention, are 
arranged on an appropriate substrate, e.g., a glass slide. The cDNA is fixed to the slide 
using, e.g., U.V. cross- linking followed, by thermal and chemical and subsequent 
drying. (See, e. g., Schena, M. et al. (1995) Science 270:467-470; and Shalon, D. et al. 

(1996) Genome Res. 6:639-645.) Fluorescent probes are prepared and used for 
hybridization to the elements on the substrate. 

[00255] Probe sequences for microarrays may be selected by screening a large 

number of clones from a variety of cDNA libraries in order to find sequences with 
conserved protein motifs common to genes coding for signal sequence containing 
polypeptides. In one embodiment, sequences identified from cDNA libraries, are 
analyzed to identify those gene sequences with conserved protein motifs using an 
appropriate analysis program, e.g., the Block 2 Bioanalysis Program (Incyte, Palo 
Alto, Calif). This motif analysis program, based on sequence information contained 
in the Swiss- Prot Database and PROSITE, is a method of determining the function of 
uncharacterized proteins translated from genomic or cDNA sequences. (See, e.g., 
Bairoch, A. et al. (1997) Nucleic Acids Res. 25:217-221; and Attwood, T. K. et al. 

(1997) J. Chem. Inf. Comput. Sci. 37:417-424.) PROSITE may be used to identify 
functional or structural domains that cannot be detected using conserved motifs due to 
extreme. sequence divergence. The method is based on weight matrices. Motifs 
identified by this method are then calibrated against the SWISS-PROT database in 
order to obtain a measure of the chance distribution of the matches. 

[00256] In another embodiment, Hidden Markov models (HMMs) may be used 
to find shared motifs, specifically consensus sequences. (See, e.g., Pearson, W. R. and 



72 



01040/i 

D. J. Lipman (1988) Proc. Natl. Acad. Sci. USA 85:2444- 2448; and Smith, T. F. and 
M. S. Waterman (1981) J. Mol. Biol. 147:195- 197.) HMMs were initially developed 
to examine speech recognition patterns, but are now being used in a biological context 
to analyze protein and nucleic acid sequences as well as to model protein structure. 
(See, e.g., Krogh, A. et al. (1994) J. Mol. Biol. 235:1501-1531; and Collin, M. et al. 
(1993) Protein Sci. 2:305-314.) HMMs have a formal probabilistic basis and use 
position-specific scores for amino acids or nucleotides. The algorithm continues to 
incorporate information from newly identified sequences to increase its motif analysis 
capabilities. 

Microarrav Uses 

[00257] " Polynucleotide sequences are particularly useful when they are 
hybridizable array elements in a microarray. Such a microarray can be employed to 
monitor the expression of genes of unknown function, but which are differentially 
expressed in precancerous or cancerous tissue. In addition, the microarray can be used 
to monitor the expression of genes with a known function in tumor biology. 
[00258] The microarray can be used for large scale genetic or gene expression 

analysis of a large number of polynucleotide sequences. The microarray can be used 
in the diagnosis of diseases, such as in the diagnosis of early stages of ductal 
carcinoma before other definitive symptoms are evident, and in the differential 
diagnosis of diseases with similar symptoms. The microarray can also be used in the 
monitoring and evaluation of treatments where altered expression of genes coding for 
polypeptides implicated in the control of cell proliferation cause disease, such as 
cancer. Additionally, the microarray can be used to investigate an individual's 
predisposition to a disease, such as cancer. Furthermore, the microarray can be 
employed to investigate cellular responses, such as cell proliferation and the like. 
[00259] When the polynucleotide sequences are employed as hybridizable 

array elements in a microarray, the array elements are organized in an ordered fashion 
so that each element is present at a specified location on the substrate. Because the 
array elements are at specified locations on the substrate, the hybridization patterns 
and intensities (which together create a unique expression profile) can be interpreted 
in terms of expression levels of particular genes and can be correlated with a 
particular disease or condition or treatment. 
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[00260] With a large enough number of transcript profiles derived from 

different biological samples, a statistically significant correlation can emerge between 
cell and tissue source information, such as disease states, treatment outcomes, 
exposure to various environmental factors or genotypes, and the expression levels of 
particular genes or groups of genes. Comparisons between transcript profiles of 
different cells or tissues or of the same cells or tissues under different conditions can 
be used to discern differences in transcriptional activities. For example, a transcript 
profile can show differences occurring between two different tissues, such as liver and 
prostate; between normal and diseased tissue, such as normal and breast tumor or 
between untreated and treated tissues, such as prostate tumor and irradiated prostate 
tumor. 

[00261] In another embodiment of the present invention, kits can be generated 

which contain the necessary reagents to carry out the assays of the present invention. 
[00262] Specifically, the invention provides a compartmentalized kit to receive, 

in close confinement, one or more containers which comprises: (a) a first container 
comprising one of the nucleic acid molecules that can bind to a fragment of the 
human genome disclosed herein; and (b) one or more other containers comprising one 
or more of the following: wash reagents, reagents capable of detecting presence of a 
bound nucleic acid. 

[00263] In detail, a compartmentalized kit includes any kit in which reagents 

are contained in separate containers. Such containers include small glass containers, 
plastic containers, strips of plastic, glass or paper, or arraying material such as silica. 
Such containers allows one to efficiently transfer reagents from one compartment to 
another compartment such that the samples and reagents are not cross-contaminated, 
and the agents or solutions of each container can be added in a quantitative fashion 
from one compartment to another. Such containers will include a container which will 
♦ accept the test sample, a container which contains the nucleic acid probe, containers 
which contain wash reagents (such as phosphate buffered saline, Tris-buffers, etc.), 
and containers which contain the reagents used to detect the bound probe. One skilled 
in the art will readily recognize that the previously unidentified genes of the present 
invention can be routinely identified using the sequence information disclosed herein 
can be readily incorporated into one of the established kit formats which are well 
known in the art, particularly expression arrays. 
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RNA Isolation, Amplification, and Labeling for Microarrav 

[00264] RNA can be isolated from the sample according to any of a number of 

methods well known to those of skill in the art. For example, methods of purification 
of nucleic acids are described in Laboratory Techniques in Biochemistry and 
Molecular Biology: Hybridization With Nucleic Acid Targets, Part L Theory and 
Nucleic Acid Preparation , P. Tijssen, ed. Elsevier (1993). When sample 
polynucleotides are amplified it is desirable to amplify the nucleic acid sample and 
maintain the relative abundances of the original sample, including low abundance 
transcripts. Total mRNA can be amplified by reverse transcription using a reverse 
transcriptase, a primer consisting of oligo d(T), and a sequence encoding the phage T7 
promoter to provide a single stranded DNA template. The second DNA strand is 
polymerized using a DNA polymerase and a RNAse which assists in breaking up.the 
DNA/RNA hybrid. After synthesis of the double stranded DNA, T7 RNA polymerase 
can be added and RNA transcribed frorn the second DNA strand template (Van 
Gelder et al. U.S. Pat. No. 5, 545,522). RNA can be amplified in vitro, in situ or in 
vivo (See Eberwine U.S. Pat. No. 5,514,545). 

[00265] The polynucleotides may be labeled with one or more labeling 
moieties to allow for detection of hybridized polynucleotide complexes. The labeling 
moieties can include compositions that can be detected by spectroscopic, 
photochemical, biochemical, bioelectronic, immunochemical, electrical, optical or 
chemical means. The labeling moieties include radioisotopes, chemiluminescent 
compounds, labeled binding proteins, heavy metal atoms, spectroscopic markers, such 
as fluorescent markers and dyes, magnetic labels, linked enzymes, mass spectrometry 
tags, spin labels, electron transfer donors and acceptors, and the like. 

Hvbridization and Analysis of Microarravs 

[00266] Hybridization causes a denatured polynucleotide and a denatured 
sample polynucleotide to form a stable duplex through base pairing. Hybridization 
methods are well known to those skilled in the art (See, for example. Laboratory 
Techniques in Biochemistry and Molecular Biology , Vol. 24: Hybridization With 
Nucleic Acid Targets , P. Tijssen, ed. Elsevier, N.Y. (1993)) Hybridization conditions 
can be defined by salt concentration, temperature, and other chemicals and conditions 
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well known in the art. In particular, stringency can be increased by reducing the 
concentration of salt, or raising the hybridization temperature. 
[00267] For example, stringent salt concentration will ordinarily be less than 

about 750 mM NaCl and 75 niM trisodium citrate, preferably less than about 500 mM 
NaCl and 50 mM trisodium citrate, and most preferably less than about 250 mM NaCl 
and 25 mM trisodium citrate. Stringent temperature conditions will ordinarily include 
temperatures of at least about 30° C, more preferably of at least about 37® C, and 
most preferably of at least about 60'' C. Varying additional parameters, such as 
hybridization time, the concentration of detergent or solvent, and the inclusion or 
exclusion of carrier DNA, are well known to those skilled in the art. Additional 
variations on these conditions will be readily apparent to those skilled in the art 
(Wahl, G. M. and S. L. Berger (1987) Methods EnzymoL 152:399- 407; Kimmel, A. 
R. (1987) Methods EnzymoL 1 52:507-5 11 ; Ausubel, F. M. et al. (1997) Short 
Protocols in Molecular Biology , John Wiley & Sons, New York, N.Y.; and 
Sambrook, J. et al. (2001) Molecular Cloning A Laboratory Manual, Cold Spring 
Harbor Press, Plainview, N.Y.). 

[00268] Hybridization reactions can be performed in a differential 

hybridization format. In a differential hybridization format, the differential expression 
of a set of genes in two biological samples is analyzed. For differential hybridization, 
polynucleotides from both biological samples are prepared and labeled with different 
labeling moieties. A mixture of the two labeled polynucleotides is added to a 
microarray. The microarray is then examined under conditions in which the emissions 
from the two different labels are individually detectable. Polynucleotides in the 
microarray that are hybridized to substantially equal numbers of polynucleotides 
derived from both biological samples give a distinct combined fluorescence (Shalon 
et al. PCX publication WO95/35505). In a preferred embodiment, the fluorophores 
Cy3 and Cy5 (Amersham Pharmacia Biotech, Piscataway N.J.) are employed as 
labels. 

[00269] After hybridization, the microarray is washed to remove nonhybridized 

nucleic acids and complex formation between the hybridizable array elements and the 
polynucleotides is detected. Methods for detecting complex formation are well known 
to those skilled in the art. 
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[00270] In a differential hybridization experiment, polynucleotides from two or 

more different biological samples are labeled with two or more different fluorescent 
labels with different emission wavelengths. Fluorescent signals are detected 
separately with different photomultipliers set to detect specific wavelengths. The 
relative abundances/expression levels of the polynucleotides in two or more samples 
is obtained. 

[00271] Typically, microarray fluorescence intensities can be normalized to 

take into account variations in hybridization intensities when more than one 
microarray is used under similar test conditions. In a preferred embodiment, 
individual polynucleotide complex hybridization intensities are normalized using the 
intensities derived from internal normalization controls contained on each microarray. 
[00272] The microarray can be used to monitor the expression level of large 

numbers of genes simultaneously and to identify splice variants, mutations, and 
polymorphisms. This information may be used to determine gene function, to 
understand the genetic basis of a disease, to diagnose a disease, and to develop and 
monitor the activities of therapeutic agents. 

mRNA Expression Analvsis Using Svbrgreen 

[00273] Where possible, micro-array results were confirmed using the S YBR 

green procedure. The SYBR green PGR procedure is a way to perform real-time PGR 
using the SYBR Green 1 Dye. Direct detection of polymerase chain reaction (PGR) 
product is monitored by measuring the increase in fluorescence caused by the binding 
of SYBR green dye to double stranded DNA. Gene specific PGR oligonucleotide 
primer pairs were designed using the Primer Express 1.5 software. (Applied 
Biosystems , Foster Gity, Ga) 

[00274] One microgram of each mRNA is added to 100 uL of a reverse 

transcriptase reaction using the ABI Taqman reverse transcription reagents with 
random hexamers according to the manufacturers protocol (Applied Biosystems, 
Foster Gity, Ga) The thermal cycling conditions included 1 cycle at 25°G for 10 
minutes, 1 cycle at 48'*G for 30 minutes and 1 cycle at 95°G for 5 minutes. Four 
hundred microliters of water is then added to the cDNA reaction. The resulting 
cDNA (10 uL) is added to a 25 uL SYBR green PGR reaction mixture according to 
the manufactures protocol. (Applied Biosystems, Foster Gity, Ga) The thermal 
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cycling conditions included 1 cycle at 95**C for ten minutes, 40 cycles at 95'*C for 15 s 
, annealing at 60°C for 1 minute. Data are expressed as the fold increase normalized 
to the same gene using the delta delta CT method for relative quantitation. For 
comparison data obtained with cDNA from colon and breast tumors were compared to 
data from pools of normal colon and breast cDNA. 

Antibodies 

[00275] The present invention further provides antibodies to the proteins of the 

present invention. Exemplary antibodies include polyclonal, monoclonal, humanized, 
bispecific, and heteroconjugate antibodies. 

Polyclonal Antibodies 

[00276] The antibodies can comprise polyclonal antibodies. Methods of 

preparing polyclonal antibodies are known to the skilled artisan. Polyclonal antibodies 
can be raised in a mammal, for example, by one or more injections of an immunizing 
agent and, if desired, an adjuvant. Typically, the immunizing agent and/or adjuvant 
will be injected in the mammal by multiple subcutaneous or intraperitoneal injections. 
The immunizing agent can include the polypeptide or a fusion protein thereof. It can 
be useful to conjugate the immunizing agent to a protein known to be immunogenic in 
the mammal being immunized. Examples of such immunogenic proteins include but 
are not limited to keyhole limpet hemocyanin, serum albumin, bovine thyroglobulin, 
and soybean trypsin inhibitor. Examples of adjuvants that can be employed include 
Freund's complete adjuvant and MPL-TDM adjuvant (monophosphoryl Lipid A, 
synthetic trehalose dicorynomycolate). Alternatively, genetic immunization may a 
usefull approach to generating antibodies without having to purify the protein of 
interest (Kilpatrick et al., 17:569-576, 1998). The immunization protocol can be 
selected by one skilled in the art without undue experimentation. 

Monoclonal Antibodies 

[00277] The antibodies can, alternatively, be monoclonal antibodies. 

Monoclonal antibodies can be prepared using hybridoma methods, such as those 
described by Kohler and Milstein, Nature, 256:495 (1975). In a hybridoma method, a 
mouse, hamster, or other appropriate host animal, is typically immunized with an 
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immunizing agent to elicit lymphocytes that produce or are capable of producing 
antibodies that will specifically bind to the immunizing agent. Alternatively, the 
lymphocytes can be immunized in vitro. 

[00278] The immunizing agent will typically include the polypeptide or a 

fusion protein thereof. Generally, either peripheral blood lymphocytes ("PBLs") are 
used if cells of human origin are desired, or spleen cells or lymph node cells are used 
if non-human mammalian sources are desired. The lymphocytes are then fused with 
an immortalized cell line using a suitable fusing agent, such as polyethylene glycol, to 
form a hybridoma cell (Coding, Monoclonal Antibodies: Principles and Practice, 
Academic Press, (1986) pp. 59-103). Immortalized cell lines are usually transformed 
mammalian cells, particularly myeloma cells of rodent, bovine, and human origin. 
Usually, rat or mouse myeloma cell lines are employed. The hybridoma cells can be 
cultured in a suitable culture medium that preferably contains one or more substances 
that inhibit the growth or survival of the unfused immortalized cells. For example, if 
the parental cells lack the enzyme hypoxanthine guanine phosphoribosyl transferase 
(HGPRT or HPRT), the culture medium for the hybridomas typically will include 
hypoxanthine, aminopterin, and thymidine ("HAT medium"), which substances 
prevent the growth of HGPRT-deficient cells: 

[00279] Preferred immortalized cell lines are those that fuse efficiently, support 

stable high level expression of antibody by the selected antibody-producing cells, and 
are sensitive to a medium such as HAT medium. More preferred immortalized cell 
lines are murine myeloma lines, which can be obtained, for instance, from the Salk 
Institute Cell Distribution Center, San Diego, California and the American Type 
Culture Collection, Manassas, Virginia. Human myeloma and mouse-human 
heteromyeloma cell lines also have been described for the production of human 
monoclonal antibodies (Kozbor, 7. ImmunoL, 133:3001 (1984); Brodeur et al, 
Monoclonal Antibody Production Techniques and Applications, Marcel Dekker, Inc., 
New York, (1987) pp. 51-63). 

[00280] The culture medium in which the hybridoma cells are cultured can then 
be assayed for the presence of monoclonal antibodies directed against a protein of the 
present invention. Preferably, the binding specificity of monoclonal antibodies 
produced by the hybridoma cells is determined by immunoprecipitation or by an in 
vitro binding assay, such as radioimmunoassay (RIA) or enzyme-linked 
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immunoiabsorbent assay (ELISA). Such techniques and assays are known in the art. 
The binding affinity of the monoclonal antibody can, for example, be determined by 
the Scatchard analysis of Munson and Pollard, AnaL Biochem., 107:220 (1980). 
[00281] After the desired hybridoma cells are identified, the clones can be 

subcloned by limiting dilution procedures and grown by standard methods (Coding, 
supra). Suitable culture media for this purpose include, for example, Dulbecco's 
Modified Eagle's Medium and RPMI1640 medium. Alternatively, the hybridoma cells 
can be grown in vivo as ascites in a mammal. The monoclonal antibodies secreted by 
the subclones can be isolated or purified from the culture medium or ascites fluid by 
conventional immunoglobulin purification procedures such as, for example, protein 
A-Sepharose, hydroxylapatite chromatography, gel electrophoresis, dialysis, or 
affinity chromatography. 

[00282] The monoclonal antibodies can also be made by recombinant DNA 

methods, such as those described in U.S. Patent No. 4,816,567. DNA encoding the 
monoclonal antibodies of the invention can be readily isolated and sequenced using 
conventional procedures (e.g., by using oligonucleotide probes that are capable of 
binding specifically to genes encoding the heavy and light chains of murine 
antibodies). The hybridoma cells of the invention serve as a preferred source of such 
DNA. Once isolated, the DNA can be placed into expression vectors, which are then 
transfected into host cells such as simian COS cells, Chinese hamster ovary (CHO) 
cells, or myeloma cells that do not otherwise produce immunoglobulin protein, to 
obtain the synthesis of monoclonal antibodies in the recombinant host cells. The DNA 
also can be modified, for example, by substituting the coding sequence for human 
heavy and light chain constant domains in place of the homologous murine sequences 
(U.S. Patent No. 4,816,567; Morrison et aL, PNAS, 81:6851-6855 (1984)) or by 
covalently joining to the immunoglobulin coding sequence all or part of the coding 
sequence for a non-immunoglobulin polypeptide. Such a non-immunoglobulin 
polypeptide can be substituted for the constant domains of an antibody of the 
invention, or can be substituted for the variable domains of one antigen-combining 
site of an antibody of the invention to create a chimeric bivalent antibody. The 
antibodies can be monovalent antibodies. Methods for preparing monovalent 
antibodies are well known in the art. For example, one method involves recombinant 
expression of immunoglobulin light chain and modified heavy chain. The heavy chain 
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is truncated generally at any point in the Fc region so as to prevent heavy chain 
crosslinking. Alternatively, the relevant cysteine residues are substituted with another 
amino acid residue or are deleted so as to prevent crosslinking. 

[00283] In vitro methods are also suitable for preparing monovalent antibodies. 

Digestion of antibodies to produce fragments thereof, particularly. Fab fragments, can 
be accomplished using routine techniques known in the art. 

Human and Humanized Antibodies 

[00284] The antibodies of the invention can further comprise humanized 

antibodies or human antibodies. Humanized forms of non-human (e.g., murine) 
antibodies are chimeric immunoglobulins, immunoglobulin chains or fragments 
thereof (such as Fv, Fab, Fab', F(ab'), or other antigen-binding subsequences of 
antibodies) which contain minimal sequence derived from non-human 
immunoglobulin. Humanized antibodies include human immunoglobulins (recipient 
antibody) in which residues from a complementary determining region (CDR) of the 
recipient are replaced by residues from a CDR of a non-human species (donor 
antibody) such as mouse, rat, or rabbit having the desired specificity, affinity and 
capacity. In some instances, Fv framework residues of the human immunoglobulin are 
replaced by corresponding nonhuman residues. Humanized antibodies can also 
comprise residues, which are found neither in the recipient antibody nor in the 
imported CDR or framework sequences. In general, the humanized antibody will 
comprise substantially all of at least one, and typically two, variable domains, in 
which all or substantially all of the CDR regions correspond to those of a nonhuman 
immunoglobulin and all or substantially all of the framework regions (FR) are those 
of a human immunoglobulin consensus sequence. The humanized antibody optimally 
also will comprise at least a portion of an immunoglobulin constant region (Fc), 
typically that of a human immunoglobulin (Jones etaL, Nature, 321:522-525 (1986); 
Riechmann etaL, Nature, 332:323329 (1988); and Presta, Curr. Op. Struct. 
fiiV7/.,2:593- 596 (1992)). 

[00285] Methods for humanizing non-human antibodies are well known in the 
art. Generally, a humanized antibody has one or more amino acid residues introduced 
into it from a source that is non-human. These non-human amino acid residues are 
often referred to as "import" residues, which are typically taken from an "import" 
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variable domain. Humanization can be essentially performed following the method of 
Winter and co-workers (Jones etal.. Nature, 321:522-525 (1986); Riechmann etaL, 
Nature, 332:323-327 (1988); Verhoeyen etaL, Science, 239:1534-1536 (1988)), by 
substituting rodent CDRs or CDR sequences for the corresponding sequences of a 
human antibody. Accordingly, such "humanized" antibodies are chimeric antibodies 
(U.S. Patent No. 4,816,567), wherein substantially less than an intact human variable 
domain has been substituted by the corresponding sequence from a non-human 
species. In practice, humanized antibodies are typically human antibodies in which 
some CDR residues and possibly some FR residues are substituted by residues from 
analogous sites in rodent antibodies. 

[00286] Human antibodies can also be produced using various techniques 

known in the art, including phage display libraries (Homogenous and Winter, J. Mol. 
BioL 227:381 (1991); Marks etaL, J. MoL Biol, 222:581 (1991)). The techniques of 
Cole et al. and Boemer et al. are also available for the preparation of human 
monoclonal antibodies (Cole et aL, Monoclonal Antibodies and Cancer Therapy, 
Alan R. Liss, p. 77 (1985) and Boemer etal, 7. Immunol, 147(l):86-95 (1991)). 
Similarly, human antibodies can be made by introducing of human immunoglobulin 
loci into transgenic animals, e.g., mice in which the endogenous immunoglobulin . 
genes have been partially or completely inactivated. Upon challenge, human antibody 
production is observed, which closely resembles that seen in humans in all respects, 
including gene rearrangement, assembly, and antibody repertoire. This approach is 
described, for example, in U.S. Patent Nos. 5,545,807; 5,545,806; 5,569,825; 
5,625,126; 5,633,425; 5,661,016, and in the following scientific publications: Marks 
et al, Bio/Technology 10, 779783 (1992); Lonberg et al. Nature 368 856-859 (1994); 
Morrison, Nature 368, 812-13 (1994); Fishwild etal, Nature Biotechnology 14, 845- 
51 (1996); Neuberger, Nature Biotechnology 14, 826 (1996); Lonberg and Huszar, 
Intern. Rev, Immunol. 13, 65-93 (1995); Tomizuka^r aL, PNAS 97, 722-727 (2000). 

Bispecific Antibodies 

[00287] Bispecific antibodies are monoclonal, preferably human or humanized, 

antibodies that have binding specificities for at least two different antigens. In the 
present case, one of the binding specificities is for a protein of the present invention; 
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the other one is for any other antigen, and preferably for a cell-surface protein or 
receptor or receptor subunit. 

[00288] Methods for making bispecific antibodies are known in the art. 
Traditionally, the recombinant production of bispecific antibodies is based on the co- 
expression of two immunoglobulin heavy-chain/light-chain pairs, where the two 
heavy chains have different specificities (Milstein and Cuello, Nature^ 305:537-539 
(1983)). Because of the random assortment of immunoglobulin heavy and light 
chains, these hybridomas (quadromas) produce a potential mixture of ten different 
antibody molecules, of which only one has the correct bispecific structure. The 
purification of the correct molecule is usually accomplished by affinity 
chromatography steps. Similar procedures are disclosed in WO 93/08829, published 
13 May 1993, and in Traunecker etal, EMBO 7., 10:3655-3659 (1991). 
[00289] Antibody variable domains with the desired binding specificities 

(antibody-antigen combining sites) can be fused to immunoglobulin constant domain 
sequences. The fusion preferably is with an immunoglobulin heavy-chain constant 
domain, comprising at least part of the hinge, CH2, and CH3 regions. It is preferred to 
have the first heavy-chain constant region (CHI) containing the site necessary for 
light-chain binding present in at least one of the fusions. DNAs encoding the 
immunoglobulin heavy-chain fusions and, if desired, the immunoglobulin light chain, 
are inserted into separate expression vectors, and are cotransfected into a suitable host 
organism. For further details of generating bispecific antibodies see, for example, 
Suresh et al. Methods in Enzymology, 121:210 (1986). 

[00290] According to another approach described in WO 96/2701 1 , the 

interface between a pair of antibody molecules can be engineered to maximize the 
percentage of heterodimers, which are recovered from recombinant cell culture. The 
preferred interface comprises at least a part of the CH3 region of an antibody constant 
domain. In this method, one or more small amino acid side chains from the interface 
of the first antibody molecule are replaced with larger side chains {e,g, tyrosine or 
tryptophan). Compensatory "cavities" of identical or similar size to the large side 
chain(s) are created on the interface of the second antibody molecule by replacing 
large amino acid side chains with smaller ones {e.g. alanine or threonine). This 
provides a mechanism for increasing the yield of the heterodimer over other unwanted 
end products such as homodimers. 
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[00291] Bispecific antibodies can be prepared as full-length antibodies or 

antibody fragments {e.g, F(ab')2 bispecific antibodies). Techniques for generating 
bispecific antibodies from antibody fragments have been described in the literature. 
For example, bispecific antibodies can be prepared can be prepared using chemical 
linkage. Brennan et al. Science 229:81 (1985) describe a procedure wherein intact 
antibodies are proteolytically cleaved to generate F(ab*)2 fragments. These fragments 
are reduced in the presence of the dithiol complexing agent sodium arsenite to 
stabilize vicinal dithiols and prevent intermolecular disulfide formation. The Fab' 
fragments generated are then converted to thionitrobenzoate (TNB) derivatives. One 
of the Fab'-TNB derivatives is then reconverted to the Fab'-thiol by reduction with 
mercaptoethylamine and is mixed with an equimolar amount of the other Fab -TNB 
derivative to form the bispecific antibody. The bispecific antibodies produced can be 
used as agents for the selective immobilization of enzymes. 

[00292] Fab' fragments can be directly recovered from E. coli and chemically 

coupled to form bispecific antibodies. Shalaby et aL, /. Exp. Med. \1 5:2X1-225 
(1992) describe the production of a fiilly humanized bispecific antibody F(ab*)2 
molecule. Each Fab' fragment was separately secreted from E. coli and subjected to 
directed chemical coupling in vitro to form the bispecific antibody. The bispecific 
antibody thus formed was able to bind to cells overexpressing the ErbB2 receptor and 
normal human T cells, as well as trigger the lytic activity of human cytotoxic 
lymphocytes against human breast tumor targets. 

[00293] Various techniques for making and isolating bispecific antibody 

fragments directly from recombinant cell culture have also been described. For 
example, bispecific antibodies have been produced using leucine zippers. Kostelny et 
aL, J. Immunol 148(5): 1547-1553 (1992). The leucine zipper peptides from the Fos 
and Jun proteins were linked to the Fab' portions of two different antibodies by gene 
fusion. The antibody homodimers were reduced at the hinge region to form monomers 
and then re-oxidized to form the antibody heterodimers. This method can also be 
utilized for the production of antibody homodimers. The "diabody" technology 
described by Hollinger et al, Proc. Natl Acad, Scl USA 90:6444-6448 (1993) has 
provided an alternative mechanism for making bispecific antibody fragments. The 
fragments comprise a heavy-chain variable domain (Vh) connected to a light-chain 
variable domain (Vl) by a linker, which is too short to allow pairing between the two 



84 



domains on the same chain. Accordingly, the Vh and Vl domains of one fragment are 
forced to pair with the complementary Vl and Vh domains of another fragment, 
thereby forming two antigen-binding sites. Another strategy for making bispecific 
antibody fragments by the use of single-chain Fv (sFv) dimers has also been reported. 
See, Gruber era/., 7. Immunol 152:5368 (1994). 

[00294] Antibodies with more than two valencies are contemplated. For 

example, trispecific antibodies can be prepared. Tutt et aL, /. Immunol 147:60 
(1991). Exemplary bispecific antibodies can bind to two different epitopes on a given 
polypeptide herein. Alternatively, an arm can be combined with an arm which binds 
to a triggering molecule on a leukocyte such as a T-cell receptor molecule {e,g, CD2, 
CD3, CD28, or B7), or Fc receptors for IgG (FcyR), such as FcyRI (CD64), FcyRII 
(CD32) and FcyRIII (CD 16) so as to focus cellular defense mechanisms to the cell 
expressing the particular a protein of the present invention. Bispecific antibodies can 
also be used to localize cytotoxic agents to cells, which express a particular a protein 
of the present invention. These antibodies possess a binding arm to a protein of the 
present invention and an arm, which binds a cytotoxic agent or a radionuclide 
chelator, such as EOTUBE, DPTA, DOTA, or TETA. Another bispecific antibody of 
interest binds the polypeptide and further binds tissue factor (TF). 

Heteroconiugate Antibodies 

[00295] Heteroconjugate antibodies are also within the scope of the present 

invention. Heteroconjugate antibodies are composed of two covalently joined 
antibodies. Such antibodies have, for example, been proposed to target immune 
system cells to unwanted cells (U.S. Patent No. 4,676,980), and for treatment of HIV 
infection (WO 91/00360; WO 92/200373; EP 03089). It is contemplated that the 
antibodies can be prepared in vitro using known methods in synthetic protein 
chemistry, including those involving crosslinking agents. For example, immunotoxins 
can be constructed using a disulfide exchange rea!ction or by forming a thioether bond. 
Examples of suitable reagents for this purpose include iminothiolate, methyl-4- 
mercaptobutyrimidate, and those disclosed, for example, in U.S. Patent No. 
4,676,980. 
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Effector Function Engineering 

[00296] It can be desirable to modify the antibody of the invention with respect 

to effector function, so as to enhance, e.g., the effectiveness of the antibody in treating 
cancer. For example, cysteine residue(s) can be introduced into the Fc region, thereby 
allowing interchain disulfide bond formation in this region. The homodimeric 
antibody thus generated can have improved internalization capability and/or increased 
complement-mediated cell killing and antibody-dependent cellular cytotoxicity 
(ADCC). See Caron et aL, 7. Exp Med., 176: 1 191 1 195 (1992) and Shopes, J. 
Immunol., 148: 2918-2922 (1992). Homodimeric antibodies with enhanced anti-tumor 
activity can also be prepared using heterobifunctional cross-linkers as described in 
Wolff et al. Cancer Research, 53: 2560-2565 (1993). Alternatively, an antibody can 
be engineered that has dual Fc regions and can thereby have enhanced complement 
lysis and ADCC capabilities. See Stevenson etaL, Anti-Cancer Drug Design, 3: 219- 
230(1989). 

Immunoconiugates 

[00297] The invention also pertains to immunoconjugates comprising an 

antibody conjugated to a cytotoxic agent such as a chemotherapeutic agent, toxin 
(e.g:, an enzymatically active toxin of bacterial, fungal, plant, or animal origin, or 
fragments thereof), or a radioactive isotope (i.e., a radioconjugate). 
[00298] Chemotherapeutic agents useful in the generation of such 

immunoconjugates have been described above. Enzymatically active toxins and 
fragments thereof that can be used include diphtheria A chain, nonbinding active 
fragments of diphtheria toxin, exotoxin A chain (from Pseudomonas aeruginosa), 
ricin A chain, abrin A chain, modeccin A chain, alpha-sarcin, A/ewr/Y^^/or^/// 
proteins, dianthin proteins, momordica charantia inhibitor, curcin, crotin, sapaonaria 
officinalis inhibitor, gelonin, mitpgellin, restrictocin, phenomycin,maytansinoids, 
enomycin, and the tricothecenes. A variety of radionuclides are available for the 
production of radioconjugated antibodies. Examples include ^^^Bi, ^^^I, ^''^In, ^^Y, and 

[00299] Conjugates of the antibody and cytotoxic agent are made using a 
variety of bifunctional protein-coupling agents such as N-succinimidyl-3-(2- 
pyridyldithiol) propionate (SPDP), iminothiolane (IT), bifunctional derivatives of 
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imidoesters (such as dimethyl adipimidate HCL), active esters (such as disuccinimidyl 
suberate), aldehydes (such as glutareldehyde), bisazido compounds (such as bis (p- 
azidobenzoyl) hexanediamine), bis-diazonium derivatives (such as bis-(p- 
diazoniumbenzoyl)-ethylenediamine), diisocyanates (such as tolyene 2,6- 
diisocyanate), and bis-active fluorine compounds (such as l,5-difluoro-2,4- 
dinitrobenzene). For example, a ricin immunotoxin can be prepared as described in 
Vitetta fl/.. Science, 238: 1098 (1987). Carbon- 14-labeled 1-isothiocyanatobenzyl- 
3-methyldiethylene triaminepentaacetic acid (MX-DTPA) is an exemplary chelating 
agent for conjugation of radionucleotide to the antibody. See W094/1 1026. 
[00300] In another embodiment, the antibody can be conjugated to a "receptor" 

(such streptavidin) for utilization in tumor pretargeting wherein the antibody-receptor 
conjugate is administered to the patient, followed by removal of unbound conjugate 
from the circulation using a clearing agent and then administration of a "ligand" {e.g., 
avidin) that is conjugated to a cytotoxic agent {e.g., a radionucleotide). 

Immunol iposomes 

[00301] The antibodies disclosed herein can also be formulated as 

immunoliposomes. Liposomes containing the antibody are prepared by methods 
known in the art, such as described in Epstein et dL, Proc. Natl. Acad. Sci. USA, 82: 
3688 (1985); Hwang etal, Proc. Natl Acad. ScL USA, 11: 4030 (1980); and U.S. Pat. 
Nos. 4,485,045 and 4,544,545. Liposomes with enhanced circulation time are 
disclosed in U.S. Patent No. 5,013,556. 

[00302] Particularly useful liposomes can be generated by the reverse-phase 

evaporation method with a lipid composition comprising phosphatidylcholine, 
cholesterol, and PEG derivatized phosphatidylethanolamine (PEG-PE). Liposomes 
are extruded through filters of defined pore size to yield liposomes with the desired 
diameter. Fab' fragments of the antibody of the present invention can be conjugated to 
the liposomes as described in Martin et al., J. Biol. Chem., 257: 286-288 (1982) via a 
disulfide-interchange reaction. A chemotherapeutic agent (such as Doxorubicin) is 
optionally contained within the liposome. See Gabizon et al., J. National Cancer Inst., 
81(19): 1484(1989). 
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Methods of Detecting Proteins and nucleic acids 

[00303] The invention also provides a method for detecting the presence or 

absence of a protein of the present invention in a biological sample. The method 
includes obtaining a biological sample from a test subject and contacting the 
biological sample with a compound or an agent capable of detecting protein or nucleic 
acid (e.g., mRNA, genomic DNA) that encodes protein such that the presence of a 
protein of the present invention is detected in the biological sample. An agent for 
detecting mRNA or genomic DNA is a labeled nucleic acid probe capable of 
hybridizing to mRNA or genomic DNA. The nucleic acid probe can be, for example, 
a full-length nucleic acid, such as the nucleic acid of SEQ ID NOs: 1-11, SEQ ID 
NOs:13-38, SEQ ID NO:31 13, or a fragment thereof, such as SEQ ID NOs:77-3011, 
SEQ ID NOs:3012-3083, or an oligonucleotide of at least 15, 30, 50, 100, 250 or 500 
nucleotides in length and sufficient to specifically hybridize under stringent 
conditions to mRNA or genomic DNA. Other suitable probes for use in the diagnostic 
assays of the invention are described herein. 

[00304] An agent for detecting protein is an antibody capable of binding to 

protein, preferably an antibody with a detectable label. Antibodies can be polyclonal, 
or more preferably, monoclonal. An intact antibody, or a fragment thereof (e.g. Fab or 
F(ab')2 can be used. The term "labeled", with regard to the probe or antibody, is 
intended to encompass direct labeling of the probe or antibody by coupling (i.e., 
physically linking) a detectable substance to the probe or antibody, as well as indirect 
labeling of the probe or antibody by reactivity with another reagent that is directly 
labeled. Examples of indirect labeling include detection of a primary antibody using a 
fluorescently labeled secondary antibody and end-labeling of a DNA probe with 
biotin such that it can be detected with fluorescently labeled streptavidin. The term 
"biological sample" is intended to include tissues, cells, and biological fluids isolated 
from a subject, as well as tissues, cells and fluids present within a subject. That is, the 
detection method of the invention can be used to detect mRNA, protein, or genomic 
DNA in a biological sample in vitro as well as in vivo. For example, in vitro 
techniques for detection of mRNA include Northern hybridizations and in situ 
hybridizations. In vitro techniques for detection of protein include enzyme linked 
immunosorbent assays (ELISAs), Western blots, immunoprecipitations and 
immunofluorescence. In vitro techniques for detection of genomic DNA include 
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Southern hybridizations. Furthermore, in vivo techniques for detection of protein 
include introducing into a subject a labeled antibody. For example, the antibody can 
be labeled with a radioactive marker whose presence and location in a subject can be 
detected by standard imaging techniques. 

[00305] In one embodiment, the biological sample contains protein molecules 

from the test subject. Alternatively, the biological sample can contain mRNA 
molecules from the test subject or genomic DNA molecules from the test subject. 
[00306] In another embodiment, the methods further involve obtaining a 

control biological sample from a control subject, contacting the control sample with a 
compound or agent capable of detecting protein, mRNA, or genomic DNA, such that 
the presence of protein, mRNA or genomic DNA is detected in the biological sample, 
and comparing the presence of protein, mRNA or genomic DNA in the control sample 
with the presence of protein, mRNA or genomic DNA in the test sample. 

Uses for Ab in diagnostics and affinity purification 

[00307] The antibodies to the proteins of the invention have various utilities. 
For example, antibodies can be used in diagnostic assays for a protein of the present 
invention, e.g., detecting its expression in specific cells, tissues, or serum. Various 
diagnostic assay techniques known in the art can be used, such as competitive binding 
assays, direct or indirect sandwich assays and immunoprecipitation assays conducted 
in either heterogeneous or homogeneous phases (Zola, Monoclonal Antibodies: A 
Manual of Techniques, CRC Press, Inc. (1987) pp. 147-158). The antibodies used in 
the diagnostic assays can be labeled with a detectable moiety. The detectable moiety 
should be capable of producing, either directly or indirectly, a detectable signal. For 
example, the detectable moiety can be a radioisotope, such as ^H, ^'^C, ^^P, ^^S, or *^^I, 
a fluorescent or chemiluminescent compound, such as fluorescein isothiocyanate, 
rhodamine, or luciferin, or an enzyme, such as alkaline phosphatase, beta- 
galactosidase or horseradish peroxidase. Any method known in the art for conjugating 
the antibody to the detectable moiety can be employed, including those methods 
described by Hunter etaL, Nature, 144:945 (1962); David et aL, Biochemistry, 
13:1014 (1974); Pain etaL, 7. Immunol Meth., 40:2l9 (1981); and Nygren, J. 
Histochem. and Cytochem., 30:407 (1982). 
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[00308] A competition assay may be employed wherein antibodies specific to 

the polypeptide are attached to a solid support and the labeled polypeptide and a 
sample derived from the host are passed over the solid support and the amount of 
label detected attached to the solid support can be correlated to a quantity of 
polypeptide in the sample. 

[00309] Antibodies to a protein of the present invention also are useful for the 
affinity purification of a protein of the present invention from recombinant cell culture 
or natural sources. In this process, the antibodies are immobilized on a suitable 
support, such a Sephadex resin or filter paper, using methods well known in the art. 
The immobilized antibody then is contacted with a sample containing a protein of the 
present invention to be purified, and thereafter the support is washed with a suitable 
solvent that will remove substantially all the material in the sample except the a 
protein of the present invention, which is bound to the immobilized antibody. Finally, 
the support is washed with another suitable solvent that will release the protein of the 
present invention from the antibody. 

Binding assays 

[00310] In binding assays, the interaction is binding and the complex formed 

can be isolated or detected in the reaction mixture. In a particular embodiment, the 
polypeptide encoded by the gene identified herein or the drug candidate is 
immobilized on a solid phase, e,g,, on a microtiter plate, by covalent or non-covalent 
attachments. Non-covalent attachment generally is accomplished by coating the solid 
surface with a solution of the polypeptide and drying. Alternatively, an immobilized 
antibody, e.g., a monoclonal antibody, specific for the polypeptide to be immobilized 
can be used to anchor it to a solid surface. The assay is performed by adding the non- 
immobilized component, which can be labeled by a detectable label, to the 
immobilized component, e.g., the coated surface containing the anchored component. 
When the reaction is complete, the non-reacted components are removed, e.g., by 
washing, and complexes anchored on the solid surface are detected. When the 
originally non-immobilized component carries a detectable label, the detection of 
label immobilized on the surface indicates that complexing occurred. Where the 
originally non-immobilized component does not carry a label, complexing can be 
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detected, for example, by using a labeled antibody specifically binding the 
immobilized complex. 

[00311] If the candidate compound interacts with but does not bind to a 

particular polypeptide encoded by a gene identified herein, its interaction with that 
polypeptide can be assayed by methods well known for detecting protein-protein 
interactions. Such assays include traditional approaches, such as^ e.g., cross-linking, 
co-immunoprecipitation, and co-purification through gradients or chromatographic 
columns. In addition, protein-protein interactions can be monitored by using a yeast- 
based genetic system described by Fields and co-workers (Fields and Song, Nature 
(London), 340:245-246 (1989); Chien et al, Proc. Natl Acad. ScL USA, 88:9578- 
9582 (1991)) as disclosed by Chevrel and Nathans, Proc. Natl. Acad. ScL USA, 89: 
5789-5793 (1991). Many transcriptional activators, such as yeast GAL4, consist of 
two physically discrete modular domains, one acting as the DNA-binding domain, 
and the other one functioning as the transcription-activation domain. The yeast 
expression system described in the foregoing publications (generally referred to as the 
"two-hybrid system") takes advantage of this property, and employs two hybrid 
proteins, one in which the target protein is fused to the DNA-binding domain of 
GAL4, and another, in which candidate activating proteins are fused to the activation 
domain. The expression of a GAL4-lacZ reporter gene under control of a GAL4- 
activated promoter depends on reconstitution of GAL4 activity via protein-protein 
interaction. Colonies containing interacting polypeptides are detected with a 
chromogenic substrate for P-galactosidase. A complete kit (MATCHMAKER™) for 
identifying protein-protein interactions between two specific proteins using the two- 
hybrid technique is commercially available from Clontech. This system can also be 
extended to map protein domains involved in specific protein interactions as well as 
to pinpoint amino acid residues that are crucial for these interactions. 

Identification of receptor/binding protein 

[00312] Antagonists can be detected by combining the polypeptide and a 

potential antagonist with membrane-bound polypeptide receptors or recombinant 
receptors or a binding protein under appropriate conditions for a competitive 
inhibition assay. The polypeptide can be labeled, such as by radioactivity, such that 
the number of polypeptide molecules bound to the receptor or binding protein can be 
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used to determine the effectiveness of the potential antagonist. The gene encoding the 
receptor or binding protein can be identified by numerous methods known to those of 
skill in the art, for example, ligand panning and FACS sorting (Coligan et al. Current 
Protocols in Immun., 1(2): Chapter 5 (1991)). 

[00313] Preferably, expression cloning is employed wherein polyadenylated 

RNA is prepared from a cell responsive to the polypeptide and a cDNA library 
created from this RNA is divided into pools and used to transfect COS cells or other 
cells that are not responsive to or do not contain binding protein activity to the 
polypeptide. Transfected cells that are grown on glass slides are exposed to labeled 
polypeptide or lysates are prepared for testing binding activity. The polypeptide can 
be labeled by a variety of means including iodination or inclusion of a recognition site 
for a site-specific protein kinase. Following fixation and incubation, the slides are 
subjected to autoradiographic analysis. Positive pools are identified and sub-pools are 
prepared and re-transfected using an interactive sub-pooling and re-screening process, 
eventually yielding a single clone that encodes the putative receptor or binding 
protein. As an alternative approach for receptor or binding protein identification, 
labeled polypeptide can be photoaffinity-linked with cell membrane or extract 
preparations that express or contain the receptor or binding protein. Cross-linked 
material is resolved by PAGE and exposed to X-ray film. The labeled complex can be 
excised, resolved into peptide fragments, and subjected to protein micro sequencing. 
The amino acid sequence obtained from micro sequencing would be used to design a 
set of degenerate oligonucleotide probes to screen a cDNA library to identify the gene 
encoding the putative receptor or binding protein. 

[00314] More specific examples of potential antagonists include a polypeptide 

that binds to the fusions of immunoglobulin with polypeptide, and, in particular, 
antibodies including, without limitation, poly- and monoclonal antibodies and 
antibody fragments, single-chain antibodies, anti-idiotypic antibodies, and chimeric or 
humanized versions of such antibodies or fragments, as well as human antibodies and 
antibody fragments or conjugated antibodies. Alternatively, a potential antagonist can 
be a closely related protein, for example, a mutated form of the polypeptide that 
recognizes the receptor or binding protein but imparts no effect, thereby competitively 
inhibiting the action of the polypeptide. 



92 



01040/1 



Inhibitors of binding interactions 

[00315] When the coding sequences for a polypeptide encode a protein which 

binds to another protein, the polypeptide can be used in assays to identify the other 
proteins or molecules involved in the binding interaction. By such methods, inhibitors 
of the binding interaction can be identified. Proteins involved in such binding 
interactions can also be used to screen for peptide or small molecule inhibitors or 
agonists of the binding interaction. Also, the receptor of a protein of the present 
invention can be used to isolate correlative ligand(s). Screening assays can be 
designed to find lead compounds that mimic the biological activity of a native protein 
or a receptor for the protein. Such screening assays will include assays amenable to 
high-throughput screening of chemical libraries, making them particularly suitable for 
identifying small molecule drug candidates. Small molecules contemplated include 
both synthetic organic or inorganic compounds. The assays can be performed in a 
variety of formats, including protein-protein binding assays, biochemical screening 
assays, immunoassays and cell based assays, which are well characterized in the art. 
Such high- and ultra-high throughput assays can also be used to test antisense 
molecules. 

Methods for identifving modulators of expression or activitv 
Cell based assays 

[00316] The invention provides a method for identifying modulators, i.e., 

candidate or test compounds or agents (e.g., antibodies to the polypeptides encoded 
by nucleic acids of SEQ ID NO:77-301 1, an antisense nucleic acid molecule, 
peptides, a polypeptide agonist, a polypeptide antagonist, peptidomimetics, small 
molecules or other drugs) that bind to the proteins of the present invention or have a 
stimulatory or inhibitory effect on, for example, expression or activity. 
[00317] In one embodiment, the invention provides assays for screening 

candidate or test compounds, which bind to or modulate the activity of the membrane- 
bound form of a protein or pol3q)eptide or biologically active portion thereof. The test 
compounds of the present invention can be obtained using any of the numerous 
approaches in combinatorial library methods known in the art, including: biological 
libraries; spatially addressable parallel solid phase or solution phase libraries; 
synthetic library methods requiring decon volution; the "one-bead one-compound" 
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library method; and synthetic library methods using affinity chromatography 
selection. The biological library approach is limited to peptide libraries, while the 
other four approaches are applicable to peptide, non-peptide oligomer, or small 
molecule libraries of compounds (Lam (1997) Anticancer Drug Des 12: 145). 
[00318] Examples of methods for the synthesis of molecular libraries can be 

found in the art, for example in: DeWitt et al. ( 1 993) Proc Natl Acad Sci U.SA, 
90:6909; Erb et al. (1994) Proc Natl Acad Sci U.S.A. 91:1 1422; Zuckermann et al. 
(1994) J Med Chem 37:2678; Cho et al. (1993) Science 261: 1303; Carrell et al. (1994) 
Angew Chem Int Ed Engl 33:2059; Carrell et al. (1994) Angew Chem Int Ed Engl 3 
3:206 1; and Gallop et al. {\99A) J Med Chem 3 7:123 3. 

[00319] Libraries of compounds may be presented in solution {e,g,, Houghten 

(1992) Biotechniques 13:412-421), or on beads (Lam {\99\) Nature 354:82-84), on 
chips (Fodor (1993) Nature 364:555-556), bacteria (Ladner U.S. Pat. No. 5,223,409), 
spores (Ladner USP '409), plasmids (Cull et al. (1992) Proc Natl Acad Sci USA 
89:1865-1869) or on phage (Scott and Smith (1990) Science 249:386-390; Devlin 
(1990) Science 249:404-406; Cwirlaet al. (1990) Proc Natl Acad Sci USA 87:6378- 
6382; Felici (1991) J Mol Biol 222:301-310; Ladner above.). 

[00320] In one embodiment, an assay is a cell-based assay in which a cell that 

expresses a membrane-bound form of protein, or a biologically active portion thereof, 
on the cell surface is contacted with a test compound and the ability of the test 
compound to bind to a protein determined. The cell, for example, can be of 
mammalian origin or a yeast cell. Determining the ability of the test compound to 
bind to the protein can be accomplished, for example, by coupling the test compound 
with a radioisotope or enzymatic label such that binding of the test compound to the 
protein or biologically active portion thereof can be determined by detecting the 
labeled compound in a complex. For example, test compounds can be labeled with 
i25j^ 35g^ 14^^ 3jj^ either directly or indirectly, and the radioisotope detected by 
direct counting of radioemission or by scintillation counting. Alternatively, test 
compounds can be enzymatically labeled with, for example, horseradish peroxidase, 
alkaline phosphatase, or luciferase, and the enzymatic label detected by determination 
of conversion of an appropriate substrate to product. In one embodiment, the assay 
comprises contacting a cell which expresses a membrane-bound form of protein; or a 
biologically active portion thereof, on the cell surface with a known compound which 
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binds the polypeptide of the present invention to form an assay mixture, contacting 
the assay mixture with a test compound, and determining the ability of the test 
compound to interact with a protein, wherein determining the ability of the test 
compound to interact with a protein comprises determining the ability of the test 
compound to preferentially bind to a polypeptide of the preset invention or a 
biologically active portion thereof as compared to the known compound. 
[00321] In another embodiment, an assay is a cell-based assay comprising 

contacting a cell expressing a membrane-bound form of protein, or a biologically 
active portion thereof, on the cell surface with a test compound and determining the 
ability of the test compound to modulate (e,g,, stimulate or inhibit) the activity of the 
protein or biologically active portion thereof. Determining the ability of the test 
compound to modulate the activity of the protein of the present invention or a 
biologically active portion thereof can be accomplished, for example, by determining 
the ability of the protein to bind to or interact with a target molecule. As used herein, 
a "target molecule" is a molecule with which a protein binds or interacts in nature, for 
example, a molecule on the surface of a cell which expresses a protein that interacts 
with a protein of the present invention, a molecule on the surface of a second cell, a 
molecule in the extracellular milieu, a molecule associated with the internal surface of 
a cell membrane or a cytoplasmic molecule. A target molecule can be a molecule 
other than of the present invention or a protein or polypeptide of the present 
invention. In one embodiment, a target molecule is a component of a signal 
transduction pathway that facilitates transduction of an extracellular signal (e.g. a 
signal generated by binding of a compound to a membrane-bound molecule) through 
the cell membrane and into the cell. The target, for example, can be a second 
intercellular protein that has catalytic activity or a protein that facilitates the 
association of downstream signaling molecules with a protein of the present 
invention. 
[00322] 

Determining the ability of the protein to bind to or interact with a target molecule can 
be accomplished by one of the methods described above for determining direct 
binding. In one embodiment, determining the ability of the protein to bind to or 
interact with a target molecule can be accomplished by determining the activity of the 
target molecule. For example, the activity of the target molecule can be determined by 
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detecting induction of a cellular second messenger of the target (i.e. intracellular Ca 
diacylglycerol, IP3, etc.), detecting catalytic/enzymatic activity of the target an 
appropriate substrate, detecting the induction of a reporter gene (comprising a 
regulatory element operatively linked to a nucleic acid encoding a detectable marker, 
e.g. J luciferase), or detecting a cellular response, for example, cell survival, cellular 
differentiation, or cell proliferation. 

[00323] In yet another embodiment, an assay of the present invention is a cell- 
free assay comprising contacting a protein or biologically active portion thereof with a 
test compound and determining the ability of the test compound to bind to the protein 
or biologically active portion thereof. Binding of the test compound to the protein can 
be determined either directly or indirectly as described above. In one embodiment, the 
assay comprises contacting the protein or biologically active portion thereof with a 
known compound which binds a protein of the present invention to form an assay 
mixture, contacting the assay mixture with a test compound, and determining the 
ability of the test compound to interact with a protein, wherein determining the ability 
of the test compound to interact with a protein comprises determining the ability of 
the test compound to preferentially bind to a protein of the present invention or 
biologically active portion thereof as compared to the known compound. 
[00324] In another embodiment, an assay is a cell-free assay comprising 

contacting protein or biologically active portion thereof with a test compouncl and 
determining the ability of the test compound to modulate (e.g, stimulate or inhibit) the 
activity of the protein or biologically active portion thereof. Determining the ability of 
the test compound to modulate the activity of a protein of the present invention can be 
accomplished, for example, by determining the ability of the protein to bind to a target 
molecule by one of the methods described above for determining direct binding. In an 
alternative embodiment, determining the ability of the test compound to modulate the 
activity of a protein of the present invention can be accomplished by determining the 
ability of the protein further modulate a target molecule. For example, the 
catalytic/enzymatic activity of the target molecule on an appropriate substrate can be 
determined as previously described. 

[00325] In yet another embodiment, the cell-free assay comprises contacting 
the protein or biologically active portion thereof with a known compound which binds 
a protein of the present invention to form an assay mixture, contacting the assay 
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mixture with a test compound, and determining the ability of the test compound to 
interact with a protein, wherein determining the abiUty of the test compound to 
interact with a protein comprises determining the ability of the protein to 
preferentially bind to or modulate the activity of a target molecule. 
[00326] The cell-free assays of the present invention are amenable to use of 
either the soluble form or the membrane-bound form of a protein of the present 
invention. In the case of cell-free assays comprising the membrane-bound form of a 
protein, it may be desirable to utilize a solubilizing agent such that the membrane- 
bound form of a protein of the present invention is maintained in solution. Examples 
of such solubilizing agents include non-ionic detergents such as n-octylglucoside, -n- 
dodecylglucoside, n-dodecylmaltoside, octanoyl-N-methylglucamide, decanoyl-N^ 
methylglucamide, Triton® XI 00, Triton® X-114, Thesit®, Isotridecypoly (ethylene 
glycol ether)n, N-dodecyl— N,N-dimethyl-3-ammonio-l-propahe sulfonate, 3-(3- 
cholamidopropyl)dimethylamminiol-l-propane sulfonate (CHAPS), or 3-(3- 
cholamidopropyl)dimethylamminiol-2-hydroxy-l-propane sulfonate (CHAPSO). 
[00327] In more than one embodiment of the above assay methods of the 

present invention, it may be desirable to immobilize either a protein of the present 
invention or its target molecule to facilitate separation of complexed from 
uncomplexed forms of one or both of the proteins, as well as to accommodate 
automation of the assay. Binding of a test compound to a protein of the present 
invention, or interaction of a protein of the present invention with a target molecule in 
the presence and absence of a candidate compound, can be accomplished in any 
vessel suitable for containing the reactants. Examples of such vessels include 
microtiter plates, test tubes, and micro-centrifuge tubes. In one embodiment, a fusion 
protein can be provided that adds a domain that allows one or both of the proteins to 
be bound to a matrix. For example, GST-protein of the present invention fusion 
proteins or GST-target fusion proteins can be adsorbed onto glutathione sepharose 
beads (Sigma Chemical, St. Louis, MO) or glutathione derivatized microtiter plates, 
that are then combined with the test compound or the test compound and either the 
non-adsorbed target protein or protein, and the mixture is incubated under conditions 
conducive to complex formation {e.g., at physiological conditions for salt and pH). 
Following incubation, the beads or microtiter plate wells are washed to remove any 
unbound components, the matrix immobilized in the case of beads, complex 
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determined either directly or indirectly, for example, as described above. 
Alternatively, the complexes can be dissociated from the matrix, and the level of a 
protein of the present invention binding or activity determined using standard 
techniques. 

[00328] Other techniques for immobilizing proteins on matrices can also be 

used in the screening assays of the invention. For example, either a protein of the 
present invention or its target molecule can be immobilized utilizing conjugation of 
biotin and streptavidin. Biotinylated protein or target molecules can be prepared from 
biotin-NHS (N-hydroxy-succinimide) using techniques well known in the art (e,g,, 
biotinylation kit. Pierce Chemicals, Rockford, 111.), and immobilized in the wells of 
streptavidin-coated 96 well plates (Pierce Chemical). Alternatively, antibodies 
reactive with a protein of the present invention or target molecules, but which do not 
interfere with binding of the protein to its target molecule, can be derivatized to the 
wells of the plate, and unbound target or protein trapped in the wells by antibody 
conjugation. Methods for detecting such complexes, in addition to those described 
above for the GST-immobilized complexes, include immunodetection of complexes 
using antibodies reactive with the protein or target molecule, as well as enzyme- 
linked assays that rely on detecting an enzymatic activity associated with the protein 
or target molecule. 

[00329] In another embodiment, modulators of a protein of the present 

invention expression are identified in a method wherein a cell is contacted with a 
candidate compound and the expression of mRNA or protein in the cell is determined. 
The level of expression of mRNA or protein in the presence of the candidate 
compound is compared to the level of expression of mRNA or protein in the absence 
of the candidate compound. The candidate compound can then be identified as a 
modulator of a protein of the present invention expression based on this comparison. 
For example, when expression of mRNA or protein is greater (statistically 
significantly greater) in the presence of the candidate compound than in its absence, 
the candidate compound is identified as a stimulator of mRNA or protein expression. 
Altematively, when expression of mRNA or protein is less (statistically significantly 
less) in the presence of the candidate compound than in its absence, the candidate 
compound is identified as an inhibitor of mRNA or protein expression. The level of 
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mRNA or protein expression in the cells can be determined by methods described 
herein for detecting mRNA or protein. 

[00330] In yet another aspect of the invention, the a protein of the present 

invention can be used as "bait proteins" in a two-hybrid assay or three hybrid assay 
(see, e.g., U.S. Pat. No. 5,283,317; Zervos et al. (1993) Cell 72:223-232; Madura et al. 
(1993) 75/0/ Chem 268:12046-12054; Bartel et al. (1993) Biotechniques 14:920-924; 
Iwabuchi et al. (1993) Oncogene 8:1693-1696; and Brent W094/10300), to identify 
other proteins that bind to or interact with a protein of the present invention and 
modulate activity of a protein of the present invention. Such binding proteins are also 
likely to be involved in the propagation of signals by the proteins as, for example, 
upstream or downstream elements of the pathway. 

[00331] The two-hybrid system is based on the modular nature of most 

transcription factors, which consist of separable DNA-binding and activation 
domains. Briefly, the assay utilizes two different DNA constructs. In one construct, 
the gene that codes for a protein of the present invention is fused to a gene encoding 
the DNA binding domain of a known transcription factor {e.g.^ GAL-4). In the other 
construct, a DNA sequence, from a library of DNA sequences, that encodes an 
unidentified protein ("prey" or "sample") is fused to a gene that codes for the 
activation domain of the known transcription factor. If the "bait" and the "prey" 
proteins are able to interact, in vivo, forming a protein-dependent complex, the DNA- 
binding and activation domains of the transcription factor are brought into close 
proximity. This proximity allows transcription of a reporter gene {e.g., LacZ) that is 
operably linked to a transcriptional regulatory site responsive to the transcription 
factor. Expression of the reporter gene can be detected and cell colonies containing 
the functional transcription factor can be isolated and used to obtain the cloned gene 
that encodes the protein, which interacts with a protein of the present invention. 

In vivo tumor models 

[00332] For cancer, a variety of well-known animal models can be used to 
further understand the role of the genes identified herein in the development and 
pathogenesis of tumors, and to test the efficacy of candidate therapeutic agents, 
including antibodies and other antagonists of the native protein of the present 
invention, such as small-molecule antagonists. The in vivo nature of such models 
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makes them particularly predictive of responses in human patients. Animal models of 
tumors and cancers (e.g,, breast cancer, colon cancer, prostate cancer, lung cancer, 
etc.) include both non-recombinant and recombinant (transgenic) animals. Non- 
recombinant animal models include, for example, rodent, e.g.^ murine models. Such 
models can be generated by introducing tumor cells which express the genes of the 
present invention into syngeneic mice using standard techniques, e,g,, subcutaneous 
injection, tail vein injection, spleen implantation, intraperitoneal implantation, 
implantation under the renal capsule, or orthotopic implantation, e.g., colon cancer 
cells implanted in colonic tissue. See, e,g,, PCT publication No. WO 97/33551, 
published September 18, 1997. Probably the most often used animal species in 
oncological studies are immunodeficient mice and, in particular, nude mice. The 
observation that the nude mouse with thymic hypo/aplasia could successfully act as a 
host for human tumor xenografts (which naturally or through recombinant technology . 
express on or more of the genes of the present invention) has lead to its widespread 
use for this purpose. The autosomal recessive nu gene has been introduced into a very 
large number of distinct congenic strains of nude mouse, including, for example, 
ASW, A/He, AKR, BALB/c, BIO.LP, C17, C3H, C57BL, C57, CBA, DBA, DDD, 
Fst, NC, NFR, NFS, NFS/N, NZB, NZC, NZW, P, RHI, and SJL. In addition, a wide 
variety of other animals with inherited immunological defects other than the nude 
mouse have been bred and used as recipients of tumor xenografts. For further details 
see, e.g.y The Nude Mouse in Oncology Research, E. Boven and B. Winograd, eds. 
(CRC Press, Inc., 1991). 

[00333] The cells introduced into such animals can be derived from known 

tumor/cancer cell lines, for example, the B 104-1-1 cell line (stable NIH-3T3 cell line 
transfected with the neu protooncogene); ra^-transfected NIH-3T3 cells; Caco-2 
(ATCC HTB-37); or a moderately well-differentiated grade n human colon 
adenocarcinoma cell line, HT-29 (ATCC HTB-38); or from tumors and cancers. 
Samples of tumor or cancer cells can be obtained from patients undergoing surgery, 
using standard conditions involving freezing and storing in liquid nitrogen(Karmali et 
aL Br. J. Cancer, 48: 689-696 (1983)). 

[00334] Tumor cells can be introduced into animals such as nude mice by a 
variety of procedures. The subcutaneous (s.c.) space in mice is very suitable for tumor 
implantation. Tumors can be transplanted s.c. as solid blocks, as needle biopsies by 
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use of a trochar, or as cell suspensions. For solid-block or trochar implantation, tumor 
tissue fragments of suitable size are introduced into the s.c. space. Cell suspensions 
are freshly prepared from primary tumors or stable tumor cell lines, and injected 
subcutaneously. Tumor cells can also be injected as subdermal implants. In this 
location, the inoculum is deposited between the lower part of the dermal connective 
tissue arid the s.c. tissue. 

[0033S] Animal models of breast cancer can be generated, for example, by 

implanting rat neuroblastoma cells (from which the neu oncogene was initially 
isolated), or neu-transformed NIH-3T3 cells into nude mice, essentially as described 
by Drebin et al, Proc. Nat. Acad ScL USA, 83: 9129-9133 (1986). 
[00336] Similarly, animal models of colon cancer can be generated by 

passaging colon cancer cells in animals, e,g., nude mice, leading to the appearance of 
tumors in these animals. An orthotopic transplant model of human colon cancer in 
nude mice has been described, for example, by Wang et aL, Cancer Research, 54: 
4726-4728 (1994) and Too et aL, Cancer Research 55: 68i-684 (1995). This model is 
based on the so-called "METAMOUSE"™ sold by Anticancer, Inc., (San Diego, 
California). ^ 

Svngenic tumor models 

[00337] Tumors that arise in animals can be removed and cultured in vitro. 

Cells from the in vitro cultures can then be passaged to animals. Such tumors can 
serve as targets for further testing or drug screening. Alternatively, the tumors 
resulting from the passage can be isolated and RNA from pre-passage cells and cells 
isolated after one or more rounds of passage analyzed for differential expression of 
genes of interest. Such passaging techniques can be performed with any known tumor 
or cancer cell lines. 

[00338] For example, Meth A, CMS4, CMS5, and WEHI- 1 64 are chemically 

induced fibrosarcomas of BALB/c female mice (DeLeo et aL, J. Exp. Med., 146: 720 
(1977)), which provide a highly controllable model system for studying the anti-tumor 
activities of various agents (Palladino etai, J. Immunol., 138: 4023-4032 (1987)). 
Briefly, tumor cells are propagated in vitro in cell culture. Prior to injection into the 
animals, the cell lines are washed and suspended in buffer, at a cell density of about 
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lOx 10^ to lOx 10^ cells/ml. The animals are then infected subcutaneously with 10 to 
100 |al of the cell suspension, allowing one to three weeks for a tumor to appear. 
[00339] In addition, the Lewis lung (3LL) carcinoma of mice, which is one of 

the most thoroughly studied experimental tumors, can be used as an investigational 
tumor model. Efficacy in this tumor model has been correlated with beneficial effects 
in the treatment of human patients diagnosed with small-cell carcinoma of the lung 
(SCCL). This tumor can be introduced in normal mice upon injection of tumor 
fragments from an affected mouse or of cells maintained in culture (Zupi et aL, Br. J, 
Cancer, 41: suppl. 4, 30 (1980)). Evidence indicates that tumors can be started from 
injection of even a single cell and that a very high proportion of infected tumor cells 
survive. For further information about this tumor model see, Zacharski, Haemostasis, 
16: 300-320(1986). 

[00340] One way of evaluating the efficacy of a test compound in an animal 

model with an implanted tumor is to measure the size of the tumor before and after 
treatment. Traditionally, the size of implanted tumors has been measured with a slide 
caliper in two or three dimensions. The measure limited to two dimensions does not 
accurately reflect the size of the tumor; therefore, it is usually converted into the 
corresponding volume by using a mathematical formula. However, the measurement 
of tumor size is very inaccurate. The therapeutic effects of a drug candidate can be 
better described as treatment-induced growth delay and specific growth delay. 
Another important variable in the description of tumor growth is the tumor volume 
doubling time. Computer programs for the calculation and description of tumor 
growth are also available, such as the program reported by Rygaard and Spang- 
Thomsen, Proc. 6th Int, Workshop on Immune-Deficient Animals Wu and Sheng eds. 
(Basel, 1989), p. 301. It is noted, however, that necrosis and inflammatory responses 
following treatment may actually result in an increase in tumor size, at least initially. 
Therefore, these changes need to be carefully monitored, by a combination of a 
morphometric method and flow cytometric analysis. 

[00341] Further, recombinant (transgenic) animal models can be engineered by 
introducing the coding portion of the genes identified herein into the genome of 
animals of interest, using standard techniques for producing transgenic animals. 
Animals that can serve as a target for transgenic manipulation include, without 
limitation, mice, rats, rabbits, guinea pigs, sheep, goats, pigs, and non-human 



102 



01040/1 

primates, e.g., baboons, chimpanzees, and monkeys. Techniques known in the art to 
introduce a transgene into such animals include pronucleic microinjection (U.S. 
Patent No. 4,873,191); retrovirus-mediated gene transfer into germ lines (e.g.. Van 
der Putten etal, Proc. Natl Acad. ScL USA, 82: 6148-615 (1985)); gene targeting in 
embryonic stem cells (Thompson etal.. Cell, 56: 313-321 (1989)); electroporation of 
embryos (Lo, Mol. Cell. Biol., 3: 1803-1814 (1983)); and sperm-mediated gene 
transfer (Lavitrano etal. Cell, 57: 717-73 (1989)). For a review, see for example, 
U.S. Patent No. 4,736,866. 

[00342] For the purpose of the present invention, transgenic animals include 

those that carry the transgene only in part of their cells ("mosaic animals"). The 
transgene can be integrated either as a single transgene, or in concatamers, e.g., head- 
to-head or head-to-tail tandems. Selective introduction of a transgene into a particular 
cell type is also possible by following, for example, the technique of Lasko et aL, . 
Proc. Natl Acad. Scl USA, 89: 6232-636 (1992). 

[00343] The expression of the transgene in transgenic animals can be 

monitored by standard techniques. For example, Southern blot analysis or PGR 
amplification can be used to verify the integration of the transgene. The level of 
mRNA expression can then be analyzed using techniques such as in situ 
hybridization. Northern blot analysis, PGR, or immunocytochemistry. The animals are 
further examined for signs of tumor or cancer development. 
[00344] Alternatively, "knock-out" animals can be constructed that have a 

defective or altered gene encoding a polypeptide identified herein, as a result of 
homologous recombination between the endogenous gene encoding the polypeptide 
and altered genomic DNA encoding the same polypeptide introduced into an 
embryonic cell of the animal. For example, cDNA encoding a particular polypeptide 
can be used to clone genomic DNA encoding that polypeptide in accordance with 
established techniques. A portion of the genomic DNA encoding a particular 
polypeptide can be deleted or replaced with another gene, such as a gene encoding a 
selectable marker that can be used to monitor integration. Typically, several kildbases 
of unaltered flanking DNA (both at the 5* and 3* ends) are included in the vector. See, 
e.g., Thomas and Capecchi, Cell, 51: 503 (1987) for a description of homologous 
recombination vectors. The vector is introduced into an embryonic stem cell line (e.g., 
by electroporation) and cells in which the introduced DNA has homologously 
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recombined with the endogenous DNA are selected. See, e,g,, Li et al. Cell 69: 915 
(1992). The selected cells are then injected into a blastocyst of an animal (e.g., a 
mouse or rat) to form aggregation chimeras. See, e.g., Bradley, in Teratocarcinomas 
and Embryonic Stem Cells: A Practical Approach, E. J. Robertson, ed. (IRL: Oxford, 
1987), pp. 1 13-152. A chimeric embryo can then be implanted into a suitable 
pseudopregnant female foster animal and the embryo brought to term to create a 
"knock-out" animal. Progeny harboring the homologously recombined DNA in their 
germ cells can be identified by standard techniques and used to breed animals in 
which all cells of the animal contain the homologously recombined DNA. Knock-out 
animals can also be generated, as is well known in the art, by administering an 
antisense molecule of the invention. Animals comprising such antisense molecules are 
specifically contemplated as an embodiment of the invention. Knockout animals can 
be characterized, for instance, by their ability to defend against certain pathological 
conditions and by their development of pathological conditions due to absence 
(knock-out) of the polypeptides. 

Animal models fnon-rodent) 

[00345] The efficacy of antibodies specifically binding a protein of the present 

invention polypeptides identified herein, and other drug candidates, can be tested also 
in the treatment of spontaneous animal tumors. A suitable target for such studies is the 
feline oral squamous cell carcinoma (SCC). Feline oral SCC is a highly invasive, 
malignant tumor that is the most common oral malignancy of cats, accounting for 
over 60% of the oral tumors reported in this species. It rarely metastasizes to distant 
sites, although this low incidence of metastasis may merely be a reflection of the short 
survival times for cats with this tumor. These tumors are usually not amenable to 
surgery, primarily because of the anatomy of the feline oral cavity. At present, there is 
no effective treatment for this tumor. Prior to entry into the study, each cat undergoes 
complete clinical examination and biopsy, and is scanned by computed tomography 
(CT). Cats diagnosed with sublingual oral squamous cell tumors are excluded from 
the study. The tongue can become paralyzed as a result of such tumor, and even if the 
treatment kills the tumor, the animals may not be able to feed themselves. Each cat is 
treated repeatedly, over a longer period of time. Photographs of the tumors will be 
taken daily during the treatment period, and at each subsequent recheck. After 
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treatment, each cat undergoes another CT scan. CT scans and thoracic radiograms are 
evaluated every 8 weeks thereafter. The data are evaluated for differences in survival, 
response, and toxicity as compared to control groups. Positive response may require 
evidence of tumor regression, preferably with improvement of quality of life and/or 
increased life span. 

[00346] In addition, other spontaneous animal tumors, such as fibrosarcoma, 
adenocarcinoma, lymphoma, chondroma, or leiomyosarcoma of dogs, cats, and 
baboons can also be tested. Of these, mammary adenocarcinoma in dogs and cats is a 
preferred model as its appearance and behavior are very similar to those in humans. 
However, the use of this model is limited by the rare occurrence of this type of tumor 
in animals. 

[00347] Other in vitro and in vivo cancer tests known in the art are also suitable 

herein. 

Screening for agonists or antagonists 

[00348] This invention encompasses methods of screening compounds to 

identify those that mimic the polypeptide (agonists) or prevent the effect of the 
polypeptide (antagonists). 

[00349] Screening assays for antagonist drug candidates are designed to 

identify cornpounds that bind or complex with the polypeptides encoded by the genes 
identified herein, or otherwise interfere with the interaction of the encoded ^ 
polypeptides with other cellular proteins. Such assays include methods identifying 
compounds that interfere with the interaction of a gene (mRNA or genomic DNA) 
encoding a polypeptide, such as those described herein. These screening assays will 
include assays amenable to high- or ultra-high-throughput screening of chemical 
libraries, making them particularly suitable for identifying antisense and small 
molecule drug candidates. 

[00350] The assays can be performed in a variety of formats, including protein- 
protein binding assays, biochemical screening assays, immunoassays, target nucleic 
acid binding assays, and cell-based assays, which are well characterized in the art. 
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Antisense 

[00351] Another potential polypeptide antagonist is an antisense construct 

prepared using antisense technology, where, for example, the antisense molecule acts 
to block directly the translation of mRNA (or transcription) by hybridizing to targeted 
mRNA (or genomic DNA) and preventing protein translation (or mRNA 
transcription) of a protein of the present invention. Antisense technology can be used 
to control gene expression through triple-helix formation or antisense DNA or RNA, 
both of which methods are based on binding of a polynucleotide to DNA or RNA. For 
example, the 5* coding portion of the polynucleotide sequence, which encodes the 
mature polypeptides herein, is used to design an antisense RNA oligonucleotide of 
from about 10 to 40 base pairs in length. A DNA oligonucleotide is designed to be 
complementary to a region of the gene involved in transcription (triple helix—see Lee 
etai, NucL Acids Res., 6:3073 (1979); Cooney etaL, Science, 241: 456 (1988); 
Dervan etai. Science, 251:1360 (1991)), thereby preventing transcription and the 
production of the polypeptide. The antisense RNA oligonucleotide hybridizes to the 
mRNA in vivo and blocks translation of the mRNA molecule into the polypeptide' 
(antisense—Okano, Neurochem., 56:560 (1991); Oligodeoxynucleotides as Antisense 
Inhibitors of Gene Expression (CRC Press: Boca Raton, FL, 1988). The 
oligonucleotides described above can also be delivered to cells such that the antisense 
RNA or DNA can be expressed in vivo to inhibit production of the polypeptide. When 
antisense DNA is used, oligodeoxyribonucleotides derived from the translation 
initiation site, e,g., between about -10 and +10 positions of the target gene nucleotide 
sequence, are preferred. 

[00352] Antisense RNA or DNA molecules are generally at least about 5 bases 

in length, about 10 bases in length, about 15 bases in length, about 20 bases in length, 
about 25 bases in length, about 30 bases in length, about 35 bases in length, about 40 
bases in length, about 45 bases in length, about 50 bases in length, about 55 bases in 
length, about 60 bases in length, about 65 bases in length, about 70 bases in length, 
about 75 bases in length, about 80 bases in length, about 85 bases in length, about 90 
bases in length, about 95 bases in length, about 100 bases in length, or more. 
[00353] The present invention employs oligomeric antisense compounds, 
particularly oligonucleotides, for use in modulating the function of nucleic acid 
molecules including SEQ ED NOs: 1-11, SEQ ID NOs: 1 3-38, and SEQ ID NO:3 1 1 3 
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encoding SEQ ID NOs:39-49, SEQ ID NOs:51-76, and SEQ ID NO: 31 14, ultimately 
modulating the amount of SEQ ID NOs:39-49, SEQ ID NOs:51-76, and SEQ ID NO: 
31 14 produced. This is accomplished by providing antisense compounds, which 
specifically hybridize with one or more nucleic acids including SEQ ID NOs:l-ll, 
SEQ ID NOs: 13-38, and SEQ ID NO:31 13 encoding SEQ ID NOs:39-49, SEQ ID 
NOs:51-76, and SEQ ID NO: 31 14. As used herein, the terms "target nucleic acid" 
and "nucleic acid including SEQ ID NOs: 1-1 1, SEQ ID NOs: 13-38, and SEQ ID 
NO:31 13 encoding SEQ ID NOs:39-49, SEQ ID NOs:51-76, and SEQ ID NO: 31 14 
encompass DNA encoding SEQ ID NOs:39-49, SEQ ID NOs:51-76, and SEQ ID NO: 
3114, RNA (including pre-mRNA and mRNA) transcribed from such DNA, and also 
cDNA derived from such RNA. The specific hybridization of an oligomeric 
compound with its target nucleic acid interferes with the normal function of the 
nucleic acid. This modulation of function of a target nucleic acid by compounds, 
which specifically hybridize to it, is generally referred to as "antisense". The functions 
of DNA to be interfered with include replication and transcription. The functions of 
RNA to be interfered with include all vital functions such as, for example, 
translocation of the RNA to the site of protein translation, translation of protein from 
the RNA, splicing of the RNA to yield one or more mRNA species, and catalytic 
activity which may be engaged in or facilitated by the RNA. The overall effect of 
such interference with target nucleic acid function is modulation of the expression of 
SEQ ID NOs:39-49, SEQ ID NOs:51-76, and SEQ ID NO: 31 14. In the context of the . 
present invention, "modulation" means either an increase (stimulation) or a decrease 
(inhibition) in the expression of a gene. In the context of the present invention, 
inhibition is the preferred form of modulation, of gene expression and mRNA is a 
preferred target. 

[00354] It is preferred to target specific nucleic acids for antisense. "Targeting" 

an antisense compound to a particular nucleic acid, in the context of this invention, is 
a multistep process. The process usually begins with the identification of a nucleic 
acid sequence whose function is to be modulated. This may be, for example, a cellular 
gene (or mRNA transcribed from the gene) whose expression is associated with a 
particular disorder or disease state, or a nucleic acid molecule from an infectious 
agent. In the present invention, the target is a nucleic acid molecule including SEQ ID 
NOs: 1-1 1, SEQ ID NOs: 13-38, and SEQ ID NO:31 13 encoding SEQ ID NOs:39-49, 
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SEQ ID NOs:51-76, and SEQ ID NO: 3114. The targeting process also includes 
determination of a site or sites within this gene for the antisense interaction to occur 
such that the desired effect, e.g., detection or modulation of expression of the protein, 
will result. Within the context of the present invention, a preferred intragenic site is 
the region encompassing the translation initiation or termination codon of the open 
reading frame (ORF) of the gene. Since, as is known in the art, the translation 
initiation codon is typically 5*-AUG (in transcribed mRNA molecules; 5'-ATG in the 
corresponding DNA molecule), the translation initiation codon is also referred to as 
the "AUG codon," the "start codon" or the "AUG start codon". A minority of genes 
have a translation initiation codon having the RNA sequence 5'-GUG, 5'-UUG or 5'- 
CUG, and 5'-AUA, 5'-ACG and 5'-CUG have been shown to function in vivo. Thus, 
the terms "translation initiation codon" and "start codon" can encompass many codon 
sequences, even though the initiator amino acid in each instance is typically 
methionine (in eukaryotes) or formylmethionine (in prokaryotes). It is also known in 
the art that eukaryotic and prokaryotic genes may have two or more alternative start 
codons, any one of which may be preferentially utilized for translation initiation in a 
particular cell type or tissue, or under a particular set of conditions. In the context of 
the invention^ "start codon" and "translation initiation codon" refer to the codon or 
codons that are used in vivo to initiate translation of an mRNA molecule transcribed 
from a gene encoding SEQ ID NOs:39-49, SEQ ID NOs:51-76, and SEQ ID NO: 
31 14, regardless of the sequence(s) of such codons. 

[00355] It is also known in the art that a translation termination codon (or "stop 

codon") of a gene may have one of three sequences, i.e. 5'-UAA, 5'-UAG and 5'- 
UGA (the corresponding DNA sequences are 5'-TAA, 5 '-TAG and 5'-TGA, 
respectively). The terms "start codon region" and "translation initiation codon region 
"refer to a portion of such an mRNA or gene that encompasses from about 25 to about 
50 contiguous nucleotides in either direction (i.e., 5' or 3') from a translation 
initiation codon. Similarly, the terms "stop codon region" and "translation termination 
codon region "refer to a portion of such an mRNA or gene that encompasses from 
about 25 to about 50 contiguous nucleotides in either direction (i.e., 5' or 3') from a 
translation termination codon. 

[00356] The open reading frame (ORF) or "coding region," which is known in 
the art to refer to the region between the translation initiation codon and the 



108 



01040/1 

translation termination codon, is also a region vyhich may be targeted effectively. 
Other target regions include the 5* untranslated region (5'UTR), known in the art to 
refer to the portion of an mRNA in the 5' direction from the translation initiation 
codon, and thus including nucleotides between the S' cap site and the translation 
initiation codon of an mRNA or corresponding nucleotides on the gene, and the 3' 
untranslated region (3'UTR), known in the art to refer to the portion of an mRNA in 
the 3' direction from the translation termination codon, and thus including nucleotides 
between the translation termination codon and 3* end of an mRNA or corresponding 
nucleotides on the gene. The 5* cap of an mRNA comprises an N7-methylated 
guanosine residue joined to the 5'-most residue of the mRNA via a 5'-5' triphosphate 
linkage. The 5' cap region of an mRNA is considered to include the 5' cap structure 
itself as well as the first 50 nucleotides adjacent to the cap. The 5' cap region may 
also be a preferred target region. 

[00357] Although some eukaryotic mRNA transcripts are directly translated, 

many contain one or more regions, known as "introns," which are excised from a 
transcript before it is translated. The remaining (and therefore translated) regions are 
known as "exons" and are spliced together to form a continuous mRNA sequence. 
mRNA splice sites, i.e., intron-exon junctions, may also be preferred target regions, 
and are particularly useful in situations where aberrant splicing is implicated in 
disease, or where an overproduction of a particular mRNA splice product is 
implicated in disease. Aberrant fusion junctions due to rearrangements or deletions 
are also preferred targets. It has also been found that introns can also be effective, and 
therefore preferred, target regions for antisense compounds targeted, for example, to 
DNA or pre-mRNA. 

[00358] Once one or more target sites have been identified, oligonucleotides 

are chosen which are sufficiently complementary to the target, i.e., hybridize 
sufficiently well and with sufficient specificity, to give the desired effect. 
[00359] In the context of this invention, "hybridization" means hydrogen 

bonding, which may be Watson-Crick, Hoogsteen, or reversed Hoogsteen hydrogen 
bonding, between complementary nucleoside or nucleotide bases. For example, 
adenine and thymine are complementary nucleobases, which pair through the 
formation of hydrogen bonds. "Complementary," as used herein, refers to the capacity 
for precise pairing between two nucleotides. For example, if a nucleotide at a certain 
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position of an oligonucleotide is capable of hydrogen bonding with a nucleotide at the 
same position of a DNA or RNA molecule, then the oligonucleotide and the DNA or 
RNA are considered to be complementary to each other at that position. The 
oligonucleotide and the DNA or RNA are complementary to each other when a 
sufficient number of corresponding positions in each molecule are occupied by 
nucleotides which can hydrogen bond with each other. Thus, "specifically 
hybridizable" and "complementary" are terms which are used to indicate a sufficient 
degree of complementarity or precise pairing such that stable and specific binding 
occurs between the oligonucleotide and the DNA or RNA target. It is understood in 
the art that the sequence of an antisense compound need not be 100% complementary 
to that of its target nucleic acid to be specifically hybridizable. An antisense 
compound is specifically hybridizable when binding of the compound to the target 
DNA or RNA molecule interferes with the normal function of the target DNA or 
RNA to cause a loss of utility, and there is a sufficient degree of complementarity to 
avoid non-specific binding of the antisense compound to non-target sequences under 
conditions in which specific binding is desired, i.e., under physiological conditions in 
the case of in vivo assays or therapeutic treatment, and in the case of in vitro assays, 
under conditions in which the assays are performed. 

[00360] Antisense compounds are commonly used as research reagents and 

diagnostics. For example, antisense oligonucleotides, which are able to inhibit gene 
expression with exquisite specificity, are often used by those of ordinary skill to 
elucidate the function of particular genes. Antisense compounds are also used, for 
example, to distinguish between functions of various members of a biological 
pathway. Antisense modulation has, therefore, been harnessed for research use. 
[00361] The specificity and sensitivity of antisense is also harnessed by those 

of skill in the art for therapeutic uses. Antisense oligonucleotides have been employed 
as therapeutic moieties in the treatment of disease states in animals and man. 
Antisense oligonucleotides have been safely and effectively administered to humans 
and numerous clinical trials are presently underway. It is thus established that 
oligonucleotides can be useful therapeutic modalities that can be configured to be 
useful in treatment regimes for treatment of cells, tissues and animals, especially 
humans. In the context of this invention, the term "oligonucleotide" refers to an 
oligomer or polymer of ribonucleic acid (RNA) or deoxyribonucleic acid (DNA) or 
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mimetics thereof. This term includes oligonucleotides composed of naturally 
occurring nucleobases, sugars and covalent intemucleoside (backbone) linkages as 
well as oligonucleotides having non-naturally occurring portions which function 
similarly. Such modified or substituted oligonucleotides are often preferred over 
native forms because of desirable properties such as, for example, enhanced cellular 
uptake, enhanced affinity for nucleic acid target and increased stability in the presence 
of nucleases. 

[00362] While antisense oligonucleotides are a preferred form of antisense 

compound, the present invention comprehends other oligomeric antisense compounds, 
including but not limited to oligonucleotide mimetics such as are described below. 
The antisense compounds in accordance with this invention preferably comprise from 
about 8 to about 30 nucleobases (i.e. from about 8 to about 30 linked nucleo sides). 
Particularly preferred antisense compounds are antisense oligonucleotides, even more 
preferably those comprising from about 12 to about 25 nucleobases. As is known in 
the art, a nucleoside is a base-sugar combination. The base portion of the nucleoside is 
normally a heterocyclic base. The two most common classics of such heterocyclic 
bases are the purines and the pyrimidines. Nucleotides are nucleosides that further 
include a phosphate group covalently linked to the sugar portion of the nucleoside. 
For those nucleosides that include a pentofuranosyl sugar, the phosphate group can be 
linked to either the 2\ 3' or 5* hydroxyl moiety of the sugar. In forming 
oligonucleotides, the phosphate groups covalently link adjacent nucleosides to one 
another to form a linear polymeric compound. In turn the respective ends of this linear 
polymeric structure can be further joined to form a circular structure, however, open 
linear structures are generally preferred. Within the oligonucleotide structure, the 
phosphate groups are commonly referred to as forming the intemucleoside backbone 
of the oligonucleotide. The normal I linkage or backbone of RNA and DNA is a 3' to 
5' phosphodiester linkage. 

[00363] Specific examples of preferred antisense compounds useful in this 

invention include oligonucleotides containing modified backbones or non-natural 
intemucleoside linkages. As defined in this specification, oligonucleotides having 
modified backbones include those that retain a phosphorus atom in the backbone and 
those that do not have a phosphorus atom in the backbone. For the purposes of this 
specification, and as sometimes referenced in the art, modified oligonucleotides that 
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do not have a phosphorus atom in their intemucleoside backbone can also be 
considered to be oligonucleosides. 

[00364] Preferred modified oligonucleotide backbones include, for example, 

phosphorothioates, chiral phosphorothioates, phosphorodithioates, phosphotriesters, 
aminoalkylphosphotriesters, methyl and other alkyl phosphonates including 
3'alkylene phosphonates and chiral phosphonates, phosphinates, phosphoramidates 
including 3*-amino phosphoramidate and aminoalkylphosphoramidates, 
thionophosphoramidates, thionoalkylphosphonates, thionoalkylphosphotriesters, and 
boranophosphates having normal 3'-5' linkages, 2'-5' linked analogs of these, and 
those having inverted polarity wherein the adjacent pairs of nucleoside units are 
linked 3'-5' to 5'-3' or 2'-5' to 5'-2'. Various salts, mixed salts and free acid forms 
are also included. 

[00365] Representative United States patents that teach the preparation of the 

above phosphorus-containing linkages include, but are not limited to, U.S. 3,687,808; 
4,469,863; 4,476,301; 5,023,243; 5,177,196; 5,188,897; 5,264,423; 5,276,019; 
5,278,302; 5,286,717; 5,321,131; 5,399,676; 5,405,939; 5,453,496; 5,455,233; 
5,466,677; 5,476,925; 5,519,126; 5,536,821; 5;541, 306; 5,550,1 11; 5,563,253; 
5,571,799; 5,587,361 ; and 5,625,050, each of which is herein incorporated by 
reference. 

[00366] Preferred modified oligonucleotide backbones that do not include a 

phosphorus atom therein have backbones that are formed by short chain alkyl or 
cycloalkyl intefnucleoside linkages, mixed heteroatom and alkyl or cycloalkyl 
intemucleoside linkages, or one or more short chain heteroatomic or heterocyclic 
intemucleoside linkages. These include those having morpholino linkages (formed in 
part from the sugar portion of a nucleoside); siloxane backbones; sulfide, sulfoxide 
and sulfone backbones; formacetyl and thioformacetyl backbones; methylene 
forrhacetyl and thioformacetyl backbones; alkene containing backbones; sulfamate 
backbones; methyleneimino and methylenehydrazino backbones; sulfonate and 
sulfonamide backbones; amide backbones; and others having mixed N, O, S and CH2 
component parts. 

[00367] Representative United States patents that teach the preparation of the 

above oligonucleosides include, but are not limited to, U.S. 5,034,506; 5,166,315; 
5,185,444; 5,214,134; 5,216,141; 5,235,033; 5,264,562; 5,264,564; 5,405,938; 
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5,434,257; 5,466,677; 5,470,967; 5,489,677; 5,541,307; 5,561,225; 5,596,086; 
5,602,240; 5,610,289; 5,602,240; 5,608,046; 5,610,289; 5,618,704; 5,623,070; 
5,663,312; 5,633,360; 5,677,437; and 5,677,439, each of which is herein incorporated 
by reference. 

[00368] In other preferred oligonucleotide mimetics, both the sugar and the 

intemucleoside linkage, i.e., the backbone, of the nucleotide units are replaced with 
novel groups. The base units are maintained for hybridization with an appropriate 
nucleic acid target compound. One such oligomeric compound, an oligonucleotide 
mimetic that has been shown to have excellent hybridization properties, is referred to 
as a peptide nucleic acid (PNA). In PNA compounds, the sugar-backbone of an 
oligonucleotide is replaced with an amide containing backbone, in particular an 
aminoethylglycine backbone. The nucleobases are retained and are bound directly or 
indirectly to aza nitrogen atoms of the amide portion of the backbone. Representative 
United States patents that teach the preparation of PNA compounds include, but are 
not limited to, U.S. 5,539,082; 5,714,331; and 5,719,262, each of which is herein 
incorporated by reference. Further teaching of PNA compounds can be found in 
Nielsen et al.. Science, 1991, 254, 1497-1500. 

[00369] Most preferred embodiments of the invention are oligonucleotides with 

phosphorothioate backbones and oligonucleosides with heteroatom backbones, and in 
particular -CH2-NH-O-CH2-, -CH2-N (CH3) -O-CH2- [known as a methylene 
(methylimino) or MMI backbone], - CH2-O-N (CH3) -CH2-, -CH2N(CH3)-N(CH3)- 
CH2- and -0-N(CH3)-CH2-CH2- [wherein the native phosphodiester backbone is 
represented as -O-P-O-CH2-] of the above referenced U.S. patent 5,489,677, and the 
amide backbones of the above referenced U.S. patent 5,602,240. Also preferred are 
oligonucleotides having morpholino backbone structures of the above-referenced U.S. 
patent 5,034,506. 

[00370] Modified oligonucleotides may also contain one or more substituted 

sugar moieties. Preferred oligonucleotides comprise one of the following at the 2' 
position: OH; F; 0-, S-, or N-alkyl; 0-, S-, or N-alkenyl; 0-, S- or N-alkynyl; or O- 
alkyl-O-alkyl, wherein the alkyl, alkenyl and alkynyl may be substituted or 
unsubstituted Ci to Cio alkyl or C2 to Cio alkenyl and alkynyl. Particularly preferred 
are 0[(CH2)nO]„,CH3, 0(CH2)n,OCH3, 0(CH2)nNH2, 0(CH2)nCH3, 0(CH2)nONH2, 
and 0(CH2nON[(CH2)nCH3)]2 where n and m are from 1 to about 10. Other preferred 
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oligonucleotides comprise one of the following at the 2* position: C| to Cio, (lower 
alkyl, substituted lower alkyl, alkaryl, aralkyl, O-alkaryl or O-aralkyl, SH, SCH3, 
OCN, CI, Br, CN, CF3, OCF3, SOCH3, SO2CH3, ONO2, NO2, N3, NH2, 
heterocycloalkyl, heterocycloalkaryl, aminoalkylamino, polyalkylamino, substituted 
silyl, an RNA cleaving group, a reporter group, an intercalator, a group for improving 
the pharmacokinetic properties of an oligonucleotide, or a group for improving the 
pharmacodynamic properties of an oligonucleotide, and other substituents having 
similar properties. A preferred modification includes 2' -methoxyethoxy (2' -O- 
CH2CH2OCH3, also known as 2'-0- (2-methoxyethyl) or 2'-MOE) (Martin et ah, 
Helv. Chim. Acta, 1995, 78, 486-504) i.e., an alkoxyalkoxy group. A further preferred 
modification includes 2'-dimethylaminooxyethoxy, i.e., a 0(CH2)20N(CH3)2 group, 
also known as 2'-DMAOE, as described in examples herein below, and 2'- 
dimethylaminoethoxyethoxy (also known in the art as 2'-0- 

dimethylaminoethoxyethyl or 2'-DMAEOE), i.e., 2'-0-CH2-0-CH2-N (CH2)2, also 
described in examples herein below. 

[00371] Other preferred modifications include 2'-methoxy (2'-0 CH3), 2'- 

aminopropoxy (2'-0 CH2 CH2 CH2NH2) and 2'-fluoro (2'-F). Similar modifications 
may also be made at other positions on the oligonucleotide, particularly the 3' 
position of the sugar on the 3' terminal nucleotide or in 2'-5* linked oligonucleotides 
and the 5' position of 5' terminal nucleotide. Oligonucleotides may also have sugar 
mimetics such as cyclobutyl moieties in place of the pentofuranosyl sugar. 
Representative United States patents that teach the preparation of such modified sugar 
structures include, but are not limited to, U.S. 4,981,957; 5,1 18,800; 5,319,080; 
5,359,044; 5,393,878; 5,446,137; 5,466,786; 5,514,785; 5,519,134; 5,567,81 1; 
5,576,427; 5,591,722; 5,597,909; 5,610,300; 5,627,053; 5,639,873; 5,646,265; 
5,658,873; 5,670,633; and 5,700,920, each of which is herein incorporated by 
reference in its entirety. 

[00372] Oligonucleotides may also include nucleobase (often referred to in the 

art simply as "base") modifications or substitutions. As used herein, "unmodified" or 
"natural" nucleobeises include the purine bases adenine (A) and guanine (G), and the 
pyrimidine bases thymine (T), cytosine (C) and uracil (U). Modified nucleobases 
include other synthetic and natural nucleobases such as 5-methylcytosine (5-me-C), 5- 
hydroxymethyl cytosine, xanthine, hypoxanthine, 2-aminoadenine, 6-methyl and other 
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alkyl derivatives of adenine and guanine, 2-propyl and other alkyl derivatives of 
adenine and guanine, 2-thiouracil, 2-'thiothyniine and 2-thiocytosine, 5-halouracil and 
cytosine, 5-propynyl uracil and cytosine, 6-azo uracil, cytosine and thymine, 5-uracil 
(pseudouracil), 4-thiouracil, 8-halo, 8-amino, 8-thiol, 8-thioalkyl, 8-hydroxyl and 
other 8-substituted adenines and guanines, 5-halo particularly 5-bromo, 5- 
trifluoromethyl and other 5-substituted uracils and cytosines, 7-methylquanine and 7- 
methyladenine, 8-azaguanine and 8-azaadenine, 7-deazaguanine and 7-deazaadenine 
and 3-deazaguanine and 3-deazaadenine. Further nucleobases include those disclosed 
in United States Patent No. 3,687,808, those disclosed in The Concise Encyclopedia 
Of Polymer Science And Engineering, pages 858-859, Kroschwitz, J. I., ed. John 
Wiley & Sons, 1990, those disclosed by Englisch et al., Angewandte Chemie, 
International Edition, 1991, 30, 613, and those disclosed by Sanghvi, Y.S., Chapter 
15, Antisense Research and Applications , pages 289-302, Crooke, S.T. and Lebleu, B. 
ed., CRC Press, 1993. Certain of these nucleobases are particularly useful for 
increasing the binding affinity of the oligomeric compounds of the invention. These 
include 5-substituted pyrimidines, 6-azapyrimidines and N-2, N-6 and 0-6 substituted 
purines, including 2-aminopropyladenine, 5-propynyluracil and 5-propynylcytosine. . 
5-methylcytosine substitutions have been shown to increase nucleic acid duplex 
stability by 0.6-1 .2°C (Sanghvi, Y.S., Crooke, S.T. and Lebleu, B., eds, Antisense 
Research and Applications , CRC Press, Boca Raton, 1993, pp. 276-278) and are 
presently preferred base substitutions, even more particularly when combined with 2'- 
O-methoxyethyl sugar modifications. 

[00373] Representative United States patents that teach the preparation of 

certain of the above noted modified nucleobases as well as other modified 
nucleobases include, but are not limited to, the above noted U.S. 3,687,808, as well as 
U.S. 4,845,205; 5,130,302; 5,134,066; 5,175,273; 5,367,066; 5,432,272; 5,457,187; 
5,459,255; 5,484,908; 5,502,177; 5,525,71 1; 5,552,540; 5,587,469; 5,594,12', 
5,596,091; 5,614,617; 5,750,692, and 5,681,941, each of which is herein incorporated 
by reference. 

[00374] Another modification of the oligonucleotides of the invention involves 
chemically linking to the oligonucleotide one or more moieties or conjugates, which 
enhance the activity, cellular distribution, or cellular uptake of the oligonucleotide. 
Such moieties include but are not limited to lipid moieties such as a cholesterol 
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moiety (Letsinger et aL, Proc. Natl Acad. ScL USA, 1989, 86, 6553-6556), cholic acid 
(Manoharan et al., Bioorg, Med, Chem, Let,, 1994, 4, 1053-1060), a thioether, e.g., 
hexyl-S-tritylthiol (Manoharan et al.,i4«/i. Acad. ScL, 1992, 660, 306-309; 
Manoharan et al., Bioorg. Med. Chem. Let., 1993, 3, 2765-2770), a thiocholesterol 
(Oberhauser et al., NucL Acids Res., 1992, 20, 533-538), an aliphatic chain, e.g., 
dodecandioi or undecyl residues (Saison-Behmoaras et al., EMBO J., 1991, 10, 111 1- 
1 1 18; Kabanov et al., FEBS Lett., 1990, 259, 327-330; Svinarchuk et al., Biochimie, 
1993, 75, 49-54), a phospholipid, e.g., di-hexadecyl-rac-glycerol or triethylammonium 
l,2-di-0-hexadecyl-rac-glycero-3-H-phosphonate (Manoharan et al., Tetrahedron 
Lett,, 1995, 36, 3651-3654; Shea et al., Nucl, Acids Res,, 1990, 18, 3777-3783), a 
polyamine or a polyethylene glycol chain (Mancharan et al., Nucleosides & 
Nucleotides, 1995, 14, 969-973), or adamantane acetic acid (Manoharan et al.. 
Tetrahedron Lett,, 1995, 36, 3651-3654), a palmityl moiety (Mishra et al., Biochim, 
Biophys, Acta, 1995, 1264, 229-237), or an octadecylamine or hexylamino-carbonyl- 
oxycholesterol moiety (Crooke et al., 7. Pharmacol. Exp. Ther., 1996, 277, 923-937). 
[00375] Representative United States patents that teach the preparation of such 

oligonucleotide conjugates include, but are not limited to, U.S: 4,828,979; 4,948,882; 
5,218,105; 5,525,465; 5,541,313; 5,545,730; 5,552,538; 5,578,717, 5,580,731; 
5,580,731; 5,591,584; 5,109,124; 5,1 18,802; 5,138,045; 5,414,077; 5,486,603; 
5,512,439; 5,578,718; 5,608,046; 4,587,044; 4,605,735; 4,667,025; 4,762,779; 
4,789,737; 4,824,941; 4,835,263; 4,876,335; 4,904,582; 4,958,013; 5,082,830; 
5,1 12,963; 5,214,136; 5,082,830; 5,1 12,963; 5,214,136; 5,245,022; 5,254,469; 
5,258,506; 5,262,536; 5,272,250; 5,292,873; 5,317,098; 5,371,241, 5,391,723; 
5,416,203, 5,451,463; 5,510,475; 5,512,667; 5,514,785; 5,565,552; 5,567,810; 
5,574,142; 5,585,481; 5,587,371; 5,595,726; 5,597,696; 5,599,923; 5,599,928 and 
5,688,941, each of which is herein incorporated by reference. 
[00376] It is not necessary for all positions in a given compound to be 
uniformly modified, and in fact more than one of the aforementioned modifications 
may be incorporated in a single compound or even at a single nucleoside within an 
oligonucleotide. The present invention also includes antisense compounds, which are 
chimeric compounds. "Chimeric" antisense compounds or "chimeras," in the context 
of this invention, are antisense compounds, particularly oligonucleotides, which 
contain two or more chemically distinct regions, each made up of at least one 
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monomer unit, i.e., a nucleotide in the case of an oligonucleotide compound. These 
oligonucleotides typically contain at least one region wherein the oligonucleotide is 
modified so as to confer upon the oligonucleotide increased resistance to nuclease 
degradation, increased cellular uptake, and/or increased binding affinity for the target 
nucleic acid. An additional region of the oligonucleotide may serve as a substrate for 
enzymes capable of cleaving- RNA:DNA or RNA:RNA hybrids. By way of example, 
RNase H is a cellular endonuclease, which cleaves the RNA strand of RNArDNA 
duplex. Activation of RNase H, therefore, results in cleavage of the RNA target, 
thereby greatly enhancing the efficiency of oligonucleotide inhibition of gene 
expression. Consequently, comparable results can often be obtained with shorter 
oligonucleotides when chimeric oligonucleotides are used, compared to 
phosphorothioate deoxyoligonucleotides hybridizing to the same target region. 
Cleavage of the RNA target can be routinely detected by gel electrophoresis and, if 
necessary, associated nucleic acid hybridization techniques known in the art. 
[00377] Chimeric antisense compounds of the invention may be formed as 

. composite structures of two or more oligonucleotides, modified oligoriucleotides, 
oligonucleosides and/or oligonucleotide mimetics as described above. Such 
compounds have also been referred to in the art as hybrids or gapmers. Representative 
United States patents that teach the preparation of such hybrid structures include, but 
are not limited to, U.S: 5,013,830; 5,149,797; 5,220,007; 5,256,775; 5,366,878; 
5,403,71 1; 5,491,133; 5,565,350; 5,623,065; 5,652,355; 5,652,356; and 5,700,922, 
each of which is herein incorporated by reference in its entirety. 
[00378] The antisense compounds used in accordance with this invention may 

be conveniently, and routinely made through the well-known technique of solid phase 
synthesis. Equipment for such synthesis is sold by several vendors including, for 
example. Applied Biosystems (Foster City, CA). Any other means for such synthesis 
known in the art may additionally or alternatively be employed. It is well known to 
use similar techniques to prepare oligonucleotides such as the phosphorothioates and 
alkylated derivatives. 

[00379] The antisense compounds of the invention are synthesized in vitro and 

do not include antisense compositions of biological origin, or genetic vector 
constructs designed to direct the in vivo synthesis of antisense molecules. The 
compounds of the invention may also be admixed, encapsulated, conjugated or 
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Otherwise associated with other molecules, molecule structures or mixtures of 
compounds, as for example, liposomes, receptor targeted molecules, oral, rectal, 
topical or other formulations, for assisting in uptake, distribution and/or absorption. 
Representative United States patents that teach the preparation of such uptake, 
distribution and/or absorption assisting formulations include, but are not limited to, 
U.S: 5,108,921; 5,354,844; 5,416,016; 5,459,127; 5,521,291; 5,543,158; 5,547,932; 
5,583,020; 5,591,721; 4,426,330; 4,534,899; 5,013,556; 5,108,921; 5,213,804; 
5,227,170; 5,264,221; 5,356,633; 5,395,619; 5,416,016; 5,417,978; 5,462,854; 
5,469,854; 5,512,295; 5,527,528; 5,534,259; 5,543,152; 5,556,948; 5,580,575; and 
5,595,756, each of which is herein incorporated by reference. 

[00380] The antisense compounds of the present invention can be utilized for 

diagnostics, therapeutics, and prophylaxis and as research reagents and kits. For 
therapeutics, an animal, preferably a human, suspected of having a disease or 
disorder, which can be treated by modulating the expression of SEQ ID NOs:39-49, 
SEQ ID NOs:51-76, and SEQ ID NO: 3114, is treated by administering antisense 
compounds in accordance with this invention. The compounds of the invention can be 
utilized in pharmaceutical compositions by adding an effective amount of an antisense 
compound to a suitable pharmaceutically acceptable diluent or carrier. Use of the 
antisense compounds and methods of the invention may also be useful 
prophylactically, e.g., to prevent or delay infection, inflammation, or tumor formation, 
for example. 

[00381] The antisense compounds of the invention are useful for research and 

diagnostics, because these compounds hybridize to nucleic acids encoding SEQ ID 
NOs:39-49, SEQ ID NOs:5.1-76, and SEQ ID NO: 3114, enabling sandwich and other 
assays to easily be constructed to exploit this fact. Hybridization of the antisense 
oligonucleotides of the invention with a nucleic acid encoding SEQ ID NOs:39-49, 
SEQ ID NOs:51-76, and SEQ ID NO: 31 14 can be detected by means known in the 
art. Such means may include conjugation of an enzyme to the oligonucleotide, 
radiolabelling of the oligonucleotide or any other suitable detection means. Kits using 
such detection means for detecting the level of SEQ ID NOs:39-49, SEQ ID NOs:51- 
76, and SEQ ID NO: 31 14 in a sample may also be prepared. 
[00382] Potential antagonists include small molecules that bind to the active 

site, the protein-binding site, or other relevant binding site (^.g., co-factor binding site. 
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substrate binding site) of the polypeptide, thereby blocking the normal biological 
activity of the polypeptide. Examples of small molecules include, but are not limited 
to, small peptides or peptide-Iike molecules, preferably soluble peptides, and synthetic 
non-peptidyl organic or inorganic compounds. 

[00383] Ribozymes are enzymatic RNA molecules capable of catalyzing the 

specific cleavage of RNA. Ribozymes act by sequence-specific hybridization to the 
complementary target RNA, followed by endo nucleolytic cleavage. Specific 
ribozyme cleavage sites within a potential RNA target can be identified by known 
techniques. For further details see, e,g., Rossi, Current Biology, 4:469-471 (1994), 
and PCT publication No. WO 97/33551 (published September 18, 1997). 
[00384] Nucleic acid molecules in triple-helix formation used to inhibit 

transcription should be single-stranded and composed of deoxynucleotides. The base 
composition of these oligonucleotides is designed such that it promotes triple-helix 
formation via Hoogsteen base-pairing rules, which generally require sizeable stretches 
^ of purines or pyrimidines on one strand of a duplex. For further details see, e.g., PCT 
publication No. WO 97/3355 1 , supra. Such molecules can have backbone bonds not 
naturally found in DNA or RNA. 

[00385] These small molecules can be identified by any one or more of the 

screening assays discussed herein and/or by any other screening techniques well 
known for those skilled in the art. 

[00386] All assays for antagonists are common in that they call for contacting 

the drug candidate with a polypeptide encoded by a nucleic acid identified herein 
under conditions and for a time sufficient to allow these two components to interact. 

Assays for antagonists 

[00387] In another assay for antagonists, mammalian cells or a membrane 

preparation expressing the receptor would be incubated with labeled polypeptide in 
the presence of the candidate compound. The ability of the compound to enhance or 
block this interaction could then be measured. 

[00388] Potential antagonists include small molecules that bind to the active 
site, the receptor-binding site, or growth factor or other relevant binding site of the 
polypeptide, thereby blocking the normal biological activity of the polypeptide. 
Examples of small molecules include, but are not limited to, antibodies, small 
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peptides or peptide-like molecules, preferably soluble peptides, and synthetic non- 
peptidyl organic or inorganic compounds. 

Drug Screening 

[00389] This invention is particularly useful for screening compounds by using 

polypeptides or binding fragment thereof in any of a variety of drug screening 
techniques. The polypeptide or fragment employed in such a test can either be free in 
solution, affixed to a solid support, borne on a cell surface, or located intracellularly. 
One method of drug screening utilizes eukaryotic or prokaryotic host cells, which are 
stably transformed with recombinant nucleic acids expressing the polypeptide or 
fragment. Drugs are screened against such transformed cells in competitive binding 
assays. Such cells, either in viable or fixed form, can be used for standard binding 
assays. One can measure, for example, the formation of complexes between 
polypeptide or a fragment and the agent being tested. Alternatively, one can examine 
the diminution in complex formation between the polypeptide and its target cell or 
target receptors caused by the agent being tested. 

[00390] Thus, the present invention provides methods of screening for drugs or 

any other agents, which can affect a disease or disorder associated with a protein of 
the present invention. These methods comprise contacting such an agent with an 
polypeptide or fragment thereof and assaying (i) for the presence of a complex 
between the agent and the polypeptide or fragment, or (ii) for the presence of a 
complex between the polypeptide or fragment and the cell, by methods well known in 
the art. In such competitive binding assays, the polypeptide or fragment is typically 
labeled. After suitable incubation, free polypeptide or fragment is separated from that 
present in bound form, and the amount of free or uncomplexed label is a measure of 
the ability of the particular agent to bind to polypeptide or to interfere with the 
polypeptide/cell complex. 

[00391] Another technique for drug screening provides high throughput 

screening for compounds having suitable binding affinity to a polypeptide and is 
described in detail in WO 84/03564, published on September 13, 1984. Briefly stated, 
large numbers of different small peptide test compounds are synthesized on a solid 
substrate, such as plastic pins or some other surface. As applied to a protein of the 
present invention, the peptide test compounds are reacted with polypeptide and 
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washed. Bound polypeptide is detected by methods well known in the art. Purified 
polypeptide can also be coated directly onto plates for use in the aforementioned drug 
screening techniques. In addition, non-neutralizing antibodies can be used to capture 
the peptide and immobilize it on the solid support. 

[00392] This invention also contemplates the use of competitive drug screening 

assays in which neutralizing antibodies capable of binding polypeptide specifically 
compete with a test compound for binding to polypeptide or fragments thereof. In 
this manner, the antibodies can be used to detect the presence of any peptide which 
shares one or more antigenic determinants with polypeptide. 

Disorders to be Treated 

Cancerous Disorders 

[00393] Polypeptides of the invention may be involved in cancer cell 

generation, proliferation or metastasis. Detection of the presence or amount of 
polynucleotides or polypeptides of the invention may be useful for the diagnosis 
and/or prognosis of one or more types of cancer. For example, the presence or 
increased expression of a polynucleotide/polypeptide of the invention may indicate a 
hereditary risk of cancer, a precancerous condition, or an ongoing malignancy. 
Conversely, a defect in the gene or absence of the polypeptide may be associated with 
a cancer condition. Identification of single nucleotide polymorphisms associated with 
cancer or a predisposition to cancer may also be useful for diagnosis or prognosis. 
[00394] Cancer treatments promote tumor regression by inhibiting tumor cell 

proliferation, inhibiting angiogenesis (growth of new blood vessels that is necessary 
to support tumor growth) and/or prohibiting metastasis by reducing tumor cell 
motility or invasiveness. Therapeutic compositions of the invention may be effective 
in adult and pediatric oncology including in solid pheise tumors/malignancies, locally 
advanced tumors, human soft tissue sarcomas, metastatic cancer, including lymphatic 
metastases, blood cell malignancies including multiple myeloma, acute and chronic 
leukemias, and lymphomas, head and neck cancers including mouth cancer, larynx 
cancer and thyroid cancer, lung cancers including small cell carcinoma and non-small 
cell cancers, breast cancers including small cell carcinoma and ductal carcinoma, 
gastrointestinal cancers including esophageal cancer, stomach cancer, colon cancer. 
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colorectal cancer and polyps associated with colorectal neoplasia, pancreatic cancers, 
liver cancer, urologic cancers including bladder cancer and prostate cancer, 
malignancies of the female genital tract including ovarian carcinoma, uterine 
(including endometrial) cancers, and solid tumor in the ovarian follicle, kidney 
cancers including renal cell carcinoma, brain cancers including intrinsic brain tumors, 
neuroblastoma, astrocytic brain tumors, gliomas, metastatic tumor cell invasion in the 
central nervous system, bone cancers including osteomas, skin cancers including 
malignant melanoma, tumor progression of human skin keratinocytes, squamous cell 
carcinoma, basal cell carcinoma, hemangiopericytoma and Karposi*s sarcoma. 

Leukemias 

[00395] Leukemias and related disorders may be treated or prevented by 

administration of a therapeutic that promotes or inhibits function of the 
polynucleotides and/or polypeptides of the invention. Such leukemias and related 
disorders include but are not limited to acute leukemia, acute lymphocytic leukemia, 
acute myelocytic leukemia, myeloblastic, promyclocytic, myelomonocytic, 
monocytic, erythroleukeniia, chronic leukemia, chronic myelocytic (granulocytic) 
leukemia and chronic lymphocytic leukemia (for a review of such disorders, see 
Fishman et al., 1985, Medicine, 2d Ed., J.B. Lippincott Co., Philadelphia). 

Other activities 

[00396] A polypeptide of the invention may also exhibit one or more of the 

following additional activities or effects: inhibiting the growth, infection or function 
of, or killing, infectious agents, including, without limitation, bacteria, viruses, fungi 
and other parasites; effecting (suppressing or enhancing) bodily characteristics, 
including, without limitation, height, weight, hair color, eye color, skin, fat to lean 
ratio or other tissue pigmentation, or organ or body part size or shape (such as, for 
example, breast augmentation or diminution, change in bone form or shape); effecting 
biorhythms or circadian cycles or rhythms; effecting the fertility of male or female 
subjects; effecting the metabolism, catabolism, anabolism, processing, utilization, 
storage or elimination of dietary fat, lipid, protein, carbohydrate, vitamins, minerals, 
co-factors or other nutritional factors or component(s); effecting behavioral 
characteristics, including, without limitation, appetite, libido, stress, cognition 



( 



122 



01040/1 



(including cognitive disorders), depression (including depressive disorders) and 
violent behaviors; providing analgesic effects or other pain reducing effects; 
promoting differentiation and growth of embryonic stem cells in lineages other than 
hematopoietic lineages; hormonal or endocrine activity; in the case of enzymes, 
correcting deficiencies of the enzyme and treating deficiency- related diseases; 
treatment of hyperproliferative disorders (such as, for example, psoriasis); 
immunoglobulin-like activity (such as, for example, the ability to bind antigens or 
complement); and the ability to act as an antigen in a vaccine composition to raise an 
immune response against such protein or another material or entity which is cross- 
reactive with such protein. 

A pplications, Administration Protocols, Schedules, Doses, and Formulations 
[00397] The molecules herein and agonists and antagonists thereto are 

pharmaceutically useful as a prophylactic and therapeutic agent for various disorders 
and diseases as set forth above. 

[00398] Therapeutic compositions of the polypeptides or agonists or 

antagonists are prepared for storage by mixing the desired molecule having the 
appropriate degree of purity with optional pharmaceutically acceptable carriers, 
excipients, or stabilizers Remington's Pharmaceutical Sciences^ 16th edition, Osol, A. 
ed, (1980)), in the form of lyophilized formulations or aqueous solutions. Acceptable 
carriers, excipients, or stabilizers are nontoxic to recipients at the dosages and 
concentrations employed, and include buffers such as phosphate, citrate, and other 
organic acids; antioxidants including ascorbic acid and methionine; preservatives 
(such as octadecyldimethylbenzyl ammonium chloride; hexamethonium chloride; 
benzalkonium chloride, benzethonium chloride; phenol, butyl or benzyl alcohol; alkyl 
parabens such as methyl or propyl paraben; catechol; resorcinol; cyclohexanol; 3 - 
pentanol; and m-cresol); low molecular weight (less than about 10 residues) 
polypeptides; proteins, such as serum albumin, gelatin, or immunoglobulins; 
hydrophilic polymers such as polyvinylpyrrolidone; amino acids such as glycine, 
glutamine, asparagine, histidine, arginine, or lysine; monosaccharides, disaccharides, 
and other carbohydrates including glucose, mannose, or dextrins; chelating agents 
such as EDTA; sugars such as sucrose, mannitol, trehalose or sorbitol; salt-forming 
counter-ions such as sodium; metal complexes {e.g,, Zn-protein complexes); and/or 
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non-ionic surfactants such as TWEEN™, PLURONICS™ or polyethylene glycol 
(PEG). 

[00399] Additional examples of such carriers include ion exchangers, alumina, 

aluminum stearate, lecithin, serum proteins, such as human serum albumin, buffer 
substances such as phosphates, glycine, sorbic acid, potassium sorbate, partial 
glyceride mixtures of saturated vegetable fatty acids, water, salts, or electrolytes such 
as protamine sulfate, disodium hydrogen phosphate, potassium hydrogen phosphate, 
sodium chloride, zinc salts, colloidal silica, magnesium trisilicate, polyvinyl 
pyrrolidone, cellulose-based substances, and polyethylene glycol. Carriers for topical 
or gel-based forms of antagonist include polysaccharides such as sodium 
carboxymethyl cellulose or methylcellulose, polyvinylpyrrolidone, poly aery lates, 
polyoxyethylene-polyoxypropylene-block polymers, polyethylene glycol, and wood 
wax alcohols. For all administrations, conventional depot forms are suitably used. 
Such forms include, for example, microcapsules, nano-capsules, liposomes, plasters, 
inhalation forms, nose sprays, sublingual tablets, and sustained-release preparations. 
The polypeptides, agonists, or antagonists will typically be formulated in such 
vehicles at a concentration of about 0. 1 mg/ml to 100 mg/ml. 
[00400] Another formulation comprises incorporating a polypeptide or 

antagonist thereof into formed articles. Such articles can be used in modulating cancer 
cell growth. In addition, tumor invasion and metastasis may be modulated with these 
articles. 

[00401] Polypeptide or antagonist to be used for in vivo administration must be 

sterile. This is readily accomplished by filtration through sterile filtration membranes, 
prior to or following lyophilization and reconstitution. Polypeptide ordinarily will be 
stored in lyophilized form or in solution if administered systemically. If in lyophilized 
form, polypeptide or antagonist thereto is typically formulated in combination with 
other ingredients for reconstitution with an appropriate diluent at the time for use. An 
example of a liquid formulation of a polypeptide or antagonist is a sterile, clear, 
colorless unpreserved solution filled in a single-dose vial for subcutaneous injection. 
' Preserved pharmaceutical compositions suitable for repeated use may contain, for 
example, depending mainly on the indication and type of polypeptide: a) a 
polypeptide or agonist or antagonist thereto; b) a buffer capable of maintaining the pH 
in a range of maximum stability of the polypeptide or other molecule in solution. 
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preferably about 4-8; c) a detergent/surfactant primarily to stabilize the polypeptide or 
molecule against agitation-induced aggregation; d) an isotonifier; e) a preservative 
selected from the group of phenol, benzyl alcohol and a benzethonium halide, e.g., 
chloride; and water. 

[00402] If the detergent employed is non-ionic, it may, for example, be 

polysorbates (e.g., POLYSORBATE™ (TWEEN™) 20, 80, etc.) or poloxamers (e.g., 
POLOXAMER™ 188). The use of non-ionic surfactants permits the formulation to be 
exposed to shear surface stresses without causing denaturation of the polypeptide. 
Further, such surfactant-containing formulations may be employed in aerosol devices 
such as those used in a pulmonary dosing, and needleless jet injector guns (see, e.g., 
EP 257,956). 

[00403] An isotonifier may be present to ensure isotonicity of a liquid 

composition of the polypeptide or antagonist thereto, and includes polyhydric sugar 
alcohols, preferably trihydric or higher sugar alcohols, such as glycerin, erythritol, 
arabitol, xylitol, sorbitol, and mannitol. These sugar alcohols can be used alone or in 
combination. Alternatively, sodium chloride or other appropriate inorganic salts may 
be used to render the solutions isotonic. 

[00404] The buffer may, for example, be an acetate, citrate, succinate, or 
phosphate buffer depending on the pH desired. The pH of one type of liquid 
formulation of this invention is buffered in the range of about 4 to 8, preferably about 
physiological pH. 

[00405] The preservatives phenol, benzyl alcohol and benzethonium halides, 

e.g., chloride, are known antimicrobial agents that may be employed. 
[00406] Therapeutic polypeptide compositions generally are placed into a 

container having a sterile access port, for example, an intravenous solution bag, or 
vial having a stopper pierceable by a hypodermic injection needle. The formulations 
are preferably administered as repeated intravenous (i.v.), subcutaneous (s.c), or 
intramuscular (i.m.) injections, or as aerosol formulations suitable for intranasal or 
intrapulmonary delivery (for intrapulmonary delivery see, e.g., EP 257,956). 
[00407] The polypeptide can also be administered in the form of sustained- 

released preparations. Suitable examples of sustained-release preparations include 
semipermeable matrices of solid hydrophobic polymers containing the protein, which 
matrices are in the form of shaped articles, e.g., films, or microcapsules. Examples of 
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sustained-release matrices include polyesters, hydrogels (e.g., poly(2-hydroxyethyl- 
methacrylate) as described by Langer et aL, 7. Biomed. Mater. Res., 15: 167-277 
(1981) and Langer, C/iem. TecK 12: 98-105 (1982) or poly(vinylalcohol)), 
polylactides (U.S. Patent No. 3,773,919, EP 58,481), copolymers of L-glutamic acid 
and gamma ethyl-L-glutamate (Sidman etal, Biopolymers^ 22: 547-556 (1983)), non- 
degradable ethylene-vinyl acetate (Langer et aL, supra)^ degradable lactic acid- 
glycolic acid copolymers such as the Lupron Depot™ (injectable microspheres 
composed of lactic acid-glycolic acid copolymer and leuprolide acetate), and poly-D- 
(-)-3hydroxybutyric acid (EP 133 988). 

[00408] While polymers such as ethylene-vinyl acetate and lactic acid-glycolic 

acid enable release of molecules for over 100 days, certain hydrogels release proteins 
for shorter time periods. When encapsulated proteins remain in the body for a long 
time, they may denature or aggregate as a result of exposure to moisture at 37°C, 
resulting in a loss of biological activity and possible changes in immunogenicity. 
Rational strategies can be devised for protein stabilization depending on the 
mechanism involved. For example, if the aggregation mechanism is discovered to be 
intermolecular S-S bond formation through thio-disulfide interchange, stabilization 
may be achieved by modifying sulfhydryl residues, lyophilizing from acidic solutions, 
controlling moisture content, using appropriate additives, and developing specific 
polymer matrix compositions. 

[00409] Sustained-release polypeptide compositions also include liposomally 

entrapped polypeptides. Liposomes containing the polypeptide are prepared by 
methods known per se: DE 3,218,121; Epstein et aL, Proc. NatL Acad. Sci. USA, 82: 
3688-3692 (1985); Hwang et aL, Proc, NatL Acad. ScL USA, 77: 4030-4034 (1980); 
EP 52 322; EP 36 676; EP 88 046; EP 143 949; EP 142,641; Japanese patent 
application 83-1 18008; U.S. Patent Nos. 4,485,045 and 4,544,545; and EP 102 324. 
Ordinarily the liposomes are of the small (about 200-800 Angstroms) unilamellar type 
in which the lipid content is greater than about 30 mol. % cholesterol, the selected 
proportion being adjusted for the optimal therapy. 

[00410] The therapeutically effective dose of polypeptide or antagonist thereto 
will, of course, vary depending on such factors as the pathological condition to be 
treated (including prevention), the method of administration, the type of compound 
being used for treatment, any co-therapy involved, the patient's age, weight, general 
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medical condition, medical history, etc., and its detemiination is well within the skill 
of a practicing physician. Accordingly, it will be necessary for the therapist to titer the 
dosage and modify the route of administration as required to obtain the maximal 
therapeutic effect. If the polypeptide has a narrow host range, for the treatment of 
human patients formulations comprising human polypeptide, more preferably native- 
sequence human polypeptide, are preferred. The clinician will administer polypeptide 
until a dosage is reached that achieves the desired effect for treatment of the condition 
in question. For example, if the objective is the treatment of CHF, the amount would 
be one that inhibits the progressive cardiac hypertrophy associated with this condition. 
The progress of this therapy is easily monitored by echocardiography. Similarly, in 
patients with hypertrophic cardiomyopathy, polypeptide can be administered on an 
empirical basis. 

[00411] With the above guidelines, the effective dose generally is within the 

range of from about 0.001 to about 1.0 mg/kg, more preferably about 0.01-1.0 mg/kg, 
most preferably about 0.01-0. 1 mg/kg. 

[00412] For non-oral use in treating human adult hypertension, it is 

advantageous to administer polypeptide in the form of an injection at about 0.01 to 50 
mg, preferably about 0.05 to 20 mg, most preferably 1 to 20 mg, per kg body weight, 
1 to 3 times daily by intravenous injection.. For oral administration, a molecule based 
on the polypeptide is preferably administered at about 5 mg to 1 g, preferably about 
10 to 100 mg, per kg body weight, 1 to 3 times daily. It should be appreciated that 
endotoxin contamination should be kept minimally at a safe level, for example, less 
than 0.5 ng/mg protein. Moreover, for human administration, the formulations 
preferably meet sterility, pyrogenicity, general safety, and purity as required by FDA 
Office and Biologies standards. 

[00413] The dosage regimen of a pharmaceutical composition containing 

polypeptide to be used in tissue regeneration will be determined by the attending 
physician considering various factors that modify the action of the polypeptides, e.g., 
amount of tissue weight desired to be formed, the site of damage, the condition of the 
damaged tissue, the size of a wound, type of damaged tissue (e.g., bone), the patient's 
age, sex, and diet, the severity of any infection, time of administration, and other 
clinical factors. The dosage may vary with the type of matrix used in the 
reconstitution and with inclusion of other proteins in the pharmaceutical composition. 
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For example, the addition of other known growth factors, such as IGF-I, to the final 
composition may also affect the dosage. Progress can be monitored by periodic 
assessment of tissue/bone growth and/or repair, for example. X-rays, 
histomorphometric determinations, and tetracycline labeling. 

[00414] The route of polypeptide or antagonist or agonist administration is in 

accord with known methods, e.g., by injection or infusion by intravenous, 
intramuscular, intracerebral, intraperitoneal, intracerebral spinal, subcutaneous, 
intraocular, intraarticular, intrasynovial, intrathecal, oral, topical, or inhalation routes, 
or by sustained-rel ease systems as noted below. The polypeptide or antagonists 
thereof also are suitably administered by intratumoral, peritumoral, intralesional, or 
perilesional routes, to exert local as well as systernic therapeutic effects. The 
intraperitoneal route is expected to be particularly useful, for example, in the 
treatment of ovarian tumors. 

[00415] If a peptide or small molecule is employed as an antagonist or agonist, 

it is preferably administered orally or non-orally in the form of a liquid or solid to 
mammals. 

[00416] Examples of pharmacologically acceptable salts of molecules that form 

salts and are useful hereunder include alkali metal salts (e.g., sodium salt, potassium 
salt), alkaline earth metal salts (e.g., calcium salt, magnesium salt), ammonium salts, 
organic base salts (e.g., pyridine salt, triethylamine salt), inorganic acid salts (e.g., 
hydrochloride, sulfate, nitrate), and salts of organic acid (e.g., acetate, oxalate, p- 
toluenesulfonate). 

Therapeutic compositions and combination therapy 

[00417] Polypeptides, polynucleotides, or modulators of polypeptides of the 

invention (including inhibitors and stimulators of the biological activity of the 
polypeptide of the invention) may be administered to treat cancer. Therapeutic 
compositions can be administered in therapeutically effective dosages alone or in 
combination with adjuvant cancer therapy such as surgery, chemotherapy, 
radiotherapy, thermotherapy, and laser therapy, and may provide a beneficial effect, 
e.g. reducing tumor size, slowing rate of tumor growth, inhibiting metastasis, or 
otherwise improving overall clinical condition, without necessarily eradicating the 
cancer. 



128 



01040/1 



[00418] The composition can also be administered in therapeutically effective 

amounts as a portion of an anti-cancer cocktail. An anti-cancer cocktail is a mixture of 
the polypeptide or modulator of the invention with one or more anti-cancer drugs in 
addition to a pharmaceutically acceptable carrier for delivery. The use of anti-cancer 
cocktails as a cancer treatment is routine. Anti-cancer drugs that are well known in the 
art and can be used as a treatment in combination with the polypeptide or modulator 
of the invention include: Actinomycin D, Aminoglutethimide, Asparaginase, 
Bleomycin, Busulfan. Carboplatin, Carmustine, Chlorambucil, Cisplatin (cis-DDP), 
Cyclophosphamide, Cytarabine HCI (Cytosine arabinoside), Dacarbazine, 
Dactinomycin, Daunorubicin HCI, Doxorubicin HCI, Estramustine phosphate 
sodium, Etoposide (VI 6-2 13), Floxuridine, 5- Fluorouracil (5-Fu), Flutamide, 
Hydroxyurea (hydroxycarbamide), Ifosfamide, Interferon Alpha-2a, Interferon Alpha- 
2b, Leuprolide acetate (LHRH-releasing factor analog), Lomustine, Mechlorethamine 
HCI' (nitrogen mustard), Melphalan, Mercaptopurine, Mesna, Methotrexate (MTX), 
Mitomycin, Mitoxantrone HCI, Octreotide, Plicamycin, Procarbazine HCI, 
Streptozocin, Tamoxifen citrate, Thioguanine, Thiotepa, Vinblastine sulfate. 
Vincristine sulfate, Amsacrine, Azacitidine, Hexamethylmelamine, Interleukin-2, 
Mitoguazone, Pentostatin, Semustine, Teniposide, and Vindesine sulfate. 
[00419] In addition, therapeutic compositions of the invention may be used for 

prophylactic treatment of cancer. There are hereditary conditions and/or 
environmental situations (e.g. exposure to carcinogens) known in the art that 
predispose an individual to developing cancers. Under these circumstances, it may be 
beneficial to treat these individuals with therapeutically effective doses of the 
polypeptide of the invention to reduce the risk of developing cancers. 

Pharmaceutical Compositions of Antibodies 

[00420] Antibodies specifically binding a polypeptide identified herein, as well 

as other molecules identified by the screening assays disclosed herein, can be 
administered for the treatment of various disorders in the form of pharmaceutical 
compositions. 

[00421] If the polypeptide is intracellular and whole antibodies are used as 

inhibitors, internalizing antibodies are preferred. However, lipofections or liposomes 
can also be used to deliver the antibody, or an antibody fragment, into cells. Where 
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antibody fragments are used, the smallest inhibitory fragment that specifically binds 
to the binding domain of the target protein is preferred. For example, based upon the 
variable-region sequences of an antibody, peptide molecules can be designed that 
retain the ability to bind the target protein sequence. Such peptides can be synthesized 
chemically and/or produced by recombinant DNA technology. See, ^.g., Marasco et 
al, Proc, Natl Acad. ScL USA, 90: 7889-7893 (1993). The formulation herein can 
also contain more than one active compound as necessary for the particular indication 
being treated, preferably those with complementary activities that do not adversely 
affect each other. Alternatively, or in addition, the composition can comprise an agent 
that enhances its function, such as, for example, a cytotoxic agent, cytokine, 
chemotherapeutic agent, or growth-inhibitory agent. Such molecules are suitably 
present in combination in amounts that are effective for the purpose intended. 

In vitro models for effective doses 

[00422] In vitro models can be used to determine the effective doses of the 

polypeptide of the invention as a potential cancer treatment. These in vitro models 
include proliferation assays of cultured tumor cells, growth of cultured tumor cells in 
soft agar (see Freshney, (1987) Culture of Animal Cells: A Manual of Basic 
Technique, Wily-Liss, New York, NY Ch 18 and Ch 21), tumor systems in nude mice 
as described in Gioyanella et al., 7. NatL Can, Inst, 52: 921-30 (1974), mobility and 
invasive potential of tumor cells in Boyden Chamber assays as described in Pilkington 
et al.. Anticancer Res., 17: 4107-9 (1997), and angiogenesis assays such as induction 
of vascularization of the chick chorioallantoic membrane or induction of vascular 
endothelial cell migration as described in Ribatta et al., Intl. J. Dev. Biol., 40: 11 89-97 
(1999) and Li et al., Clin. Exp. Metastasis, 17:423-9 (1999), respectively. Suitable 
tumor cells lines are available, e.g. from American Type Tissue Culture Collection 
catalogs. 

Articles of Manufacture 

[00423] An article of manufacture such as a kit containing polypeptide or 

antagonists thereof useful for the diagnosis or treatment of the disorders described 
above comprises at least a container and a label. Suitable containers include, for 
example, bottles, vials, syringes, and test tubes. The containers may be formed from a 



130 



01040/1 



variety of materials such as glass or plastic. The container holds a composition that is 
effective for diagnosing or treating the condition and may have a sterile access port 
(for example, the container may be an intravenous solution bag or a vial having a 
stopper pierceable by a hypodermic injection needle). The active agent in the 
composition is the polypeptide or an agonist or antagonist thereto. The label on, or 
associated with, the container indicates that the composition is used for diagnosing or 
treating the condition of choice. The article of manufacture may further comprise a 
second container comprising a pharmaceutically acceptable buffer, such as phosphate- 
buffered saline. Ringer's solution, and dextrose solution. It may further include other 
materials desirable from a commercial and user standpoint, including other buffers, 
diluents, filters, needles, syringes, and package inserts with instructions for use. The 
article of manufacture may also comprise a second or third container with another 
active agent as described above. 

Diagnosis, Prognosis, and Management of Cancer 

[00424] The polynucleotides described herein, as well as their gene products 

and corresponding genes and gene products, are of particular interest as genetic or 
biochemical markers (e.g., in blood or tissues) that will detect the earliest changes 
along the carcinogenesis pathway and/or to monitor the efficacy of various therapies 
and preventive interventions. 

[00425] For example, the level of expression of certain polynucleotides can be 

indicative of a poorer prognosis, and therefore warrant more aggressive chemo- or 
radio-therapy for a patient or vice versa. 

[00426] The correlation of novel surrogate tumor specific features with 

response to treatment and outcome in patients can define prognostic indicators that 
allow the design of tailored therapy based on the molecular profile of the tumor. 
These therapies include antibody targeting, antagonists (e.g., small molecules), and 
gene therapy. 

[00427] Determining expression of certain polynucleotides and comparison of a 

patients profile with known expression in normal tissue and variants of the disease 
allows a determination of the best possible treatment for a patient, both in terms of 
specificity of treatment and in terms of comfort level of the patient. Surrogate tumor 
markers, such as polynucleotide expression, can also be used to better classify, and 
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thus diagnose and treat, different forms and disease states of cancer. Two 
classifications widely used in oncology that can benefit from identification of the 
expression levels of the genes corresponding to the polynucleotides described herein 
are staging of the cancerous disorder, and grading the nature of the cancerous tissue. 
[00428] The polynucleotides that correspond to differentially expressed genes, 
as well as their encoded gene products, can be useful to monitor patients having or 
susceptible to cancer to detect potentially malignant events at a molecular level before 
they are detectable at a gross morphological level. In addition, the polynucleotides 
described herein, as well as the genes corresponding to such polynucleotides, can be 
useful as therametrics, e.g., to assess the effectiveness of therapy by using the 
polynucleotides or their encoded gene products, to assess, for example, tumor burden 
in the patient before, during, and after therapy. 

[00429] Furthermore, a polynucleotide identified as corresponding to a gene 

that is differentially expressed in, and thus is important for, one type of cancer can 
also have implications for development or risk of development of other types of 
cancer, e.g., where a polynucleotide represents a gene differentially expressed across 
various cancer types. Thus, for example, expression of a polynucleotide 
corresponding to a gene that has clinical implications for metastatic colon cancer can 
also have clinical implications for breast cancer or ovarian cancer. 

Staging. 

[00430] Staging is a process used by physicians to describe how advanced the 

cancerous state is in a patient. Staging assists the physician in determining a 
prognosis, planning treatment and evaluating the results of such treatment. Staging 
systems vary with the types of cancer, but generally involve the following "TNM" 
system: the type of tumor, indicated by T and a number which describes the tumor's 
size; whether the cancer has metastasized to nearby lymph nodes, indicated by N; and 
whether the cancer has metastasized to more distant parts of the body, indicated by M. 
Generally, if a cancer is only detectable in the area of the primary lesion without 
having spread to any lymph nodes it is called Stage I. If it has spread only to the 
closest lymph nodes, it is called Stage II. In Stage HI, the cancer has generally spread 
to the lymph nodes in near proximity to the site of the primary lesion. Cancers that 
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have spread to a distant part of the body, such as the liver, bone, brain or other site, 
are Stage IV, the most advanced stage. 

[00431] The polynucleotides and corresponding genes and gene products 
described herein can facilitate fine-tuning of the staging process by identifying 
markers for the aggressiveness of a cancer, e.g. the metastatic potential, as well as the 
presence in different areas of the body. Thus, a Stage 11 cancer with a polynucleotide 
signifying a high metastatic potential cancer can be used to change a borderline Stage 
II tumor to a Stage III tumor, justifying more aggressive therapy. Conversely, the 
presence of a polynucleotide signifying a lower metastatic potential allows more 
conservative staging of a tumor. 

Grading of cancers. 

[00432] Grade is a term used to describe how closely a tumor resembles normal 

tissue of its same type. The microscopic appearance of a tumor is used to identify 
tumor grade based on parameters such as cell morphology, cellular organization, and 
other markers of differentiation. As a general rule, the grade of a tumor corresponds to 
its rate of growth or aggressiveness, with undifferentiated or high-grade tumors 
V generally being more aggressive than well differentiated or low grade tumors. The 
following guidelines are generally used for grading tumors: 1) GX Grade cannot be 
assessed; 2) Gl Well differentiated; G2 Moderately well differentiated; 3) G3 Poorly 
differentiated; 4) G4 Undifferentiated. The polynucleotides of the SEQ ID NOs:77- 
30 11 and their corresponding genes and gene products, can be especially valuable in 
determining the grade of the tumor, as they not only can aid in determining the 
differentiation status of the cells of a tumor, they can also identify factors other than 
differentiation that are valuable in determining the aggressiveness of a tumor, such as 
metastatic potential. 

Detection of colon cancer. 

[00433] The polynucleotides corresponding to genes that exhibit the 
appropriate expression pattern can be used to detect colon cancer in a subject. 
Colorectal cancer is one of the most common neoplasms in humans and perhaps the 
most frequent form of hereditary neoplasia. 
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[00434] Prevention and early detection are key factors in controlling and curing 

colorectal cancer. Colorectal cancer begins as polyps, which are small, benign 
growths of cells that form on the inner lining of the colon. Over a period of several 
years, some of these polyps accumulate additional mutations and become cancerous. 
Multiple familial colorectal cancer disorders have been identified, which are 
summarized as follows: 1) Familial adenomatous polyposis (FAP); 2) Gardner's 
syndrome; 3) Hereditary nonpolyposis colon cancer (HNPCC); and 4) Familial 
colorectal cancer in Ashkenazi Jews. 

[0043S] The expression of appropriate polynucleotides can be used in the 

diagnosis, prognosis and management of colorectal cancer. Detection of colon cancer 
can be determined using expression levels of any of these sequences alone or in 
combination with the levels of expression. Determination of the aggressive nature 
and/or the metastatic potential of a colon cancer can be determined by comparing 
levels of one or more gene products of the genes corresponding to the polynucleotides 
described herein, and comparing total levels of another sequence known to vary in 
cancerous tissue, e.g., expression of p53, DCC, ras, FAP (see, e.g., Fearon ER, et al. 
Cell (1990) 61(5):759; Hamilton SR et aL, Cancer (1993) 72:957; Bodmer W, et al., 
Nat Genet, (1994) 4(3):217; Fearon ER, Ann NYAcadScL (1995) 768:101). 
[00436] For example, development of colon cancer can be detected by 

examining the level of expression of a gene corresponding to a polynucleotides 
described herein to the levels of oncogenes (e.g. ras) or tumor suppressor genes (e.g. 
FAP or p53). Thus expression of specific marker polynucleotides can be used to 
discriminate between normal and cancerous colon tissue, to discriminate between 
colon cancers with different cells of origin, to discriminate between colon cancers 
with different potential metastatic rates, etc. For a review of markers of cancer, see, 
e.g., Hanahan et al. (2000) Cell 100:57-70. 

Monitoring efficacv in clinical trials 

[00437] The invention also features a method of monitoring the efficacy of a 
compound in clinical trials for inhibition of tumors, e.g., colon tumors, in a patient by 
obtaining a first sample of tumor tissue cells from the patient; administering the 
compound to the patient; after a time sufficient for the compound to inhibit the tumor, 
obtaining a second sample of tumor tissue cells from the patient; and detecting in the 
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first and second samples the level any one or more of SEQ ID NO:77-1993, SEQ ID 
NOs: 30 12-3083, or the genes of SEQ ID NO: 1-1 1, SEQ ID NOs: 13-38, and SEQ ID 
NO:31 13 wherein a level lower in the second sample than in the first sample indicates 
that the compound is effective to inhibit a tumor in the patient. Alternatively, the 
efficacy of a compound in clinical trials for inhibition of tumors, e.g., colon tumors, in 
a patient can be monitored by obtaining a first sample of tumor tissue cells from the 
patient; administering the compound to the patient; after a time sufficient for the 
compound to inhibit the tumor, obtaining a second sample of tumor tissue cells from 
the patient; and detecting in the first and second samples the level of any one or more 
of nucleic acid sequences of SEQ ID NOs: 1994-301 1 wherein a level higher in the 
second sample than in the first sample indicates that the compound is effective to 
inhibit a tumor in the patient. 

[00438] Monitoring the influence of agents (e.g., drugs, compounds) on 

inhibition of tumor growth (e.g., the ability to modulate aberrant cell proliferation 
and/or differentiation) can be applied not only in basic drug screening, but also in 
clinical trials. For example, the effectiveness of an agent, as determined by a 
screening assay as described herein, to increase gene expression, protein levels or 
protein activity, can be monitored in clinical trials of subjects exhibiting decreased 
gene expression, protein levels, or protein activity. Alternatively, the effectiveness of 
an agent, as determined by a screening assay, to decrease gene expression, protein 
levels, or protein activity, can be monitored in clinical trials of subjects exhibiting 
increased gene expression, protein levels, or protein activity. In such clinical trials, 
expression or activity of the polypeptide and preferably, that of other polypeptides 
that have been implicated in colon cancer, can be used as markers. 
[00439] For example, and not by way of limitation, genes, including those of 

the invention, that are modulated in cells by treatment with an agent (e.g., compound, 
drug or small molecule), which modulates tumor growth (e.g., as identified in a , 
screening assay described herein) can be identified. Thus, to study the effect of agents 
on cancer, e.g., colon cancer, for example, in a clinical trial, cells can be isolated and 
RNA prepared and analyzed for the levels of expression of a gene of the invention 
and other genes implicated in the disorder. The levels of gene expression (i.e., a gene 
expression pattern) can be quantified by Northern blot analysis or RT-PCR, as 
described herein, or alternatively by measuring the amount of protein produced, by 
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one of the methods as described herein, or by measuring the levels of activity of a 
gene of the invention or other genes. In this way, the gene expression pattern can 
serve as a marker, indicative of the physiological response of the cells to the agent. 
Accordingly, this response state may be determined before, and at various points 
during, treatment of the individual with the agent. 

[00440] In a preferred embodiment, the present invention provides a method 

for monitoring the effectiveness of treatment of a subject with an agent (e.g., an 
agonist, antagonist, peptidomimetic, protein, peptide, nucleic acid, small molecule, 
antibody or other drug candidate identified by the screening assays described herein) 
comprising the steps of (i) obtaining a pre-administration sample from a subject prior 
to administration of the agent; (ii) detecting the level of the polypeptide or nucleic 
acid of the invention in the pre-administration sample; (iii) obtaining one or more 
post-administration samples from the subject; (iv) detecting the level the of the 
polypeptide or nucleic acid of the invention in the post-administration sample; (v) 
comparing the level of the polypeptide or nucleic acid of the invention in the pre- 
administration sample with the level of the polypeptide or nucleic acid of the 
invention in the post-administration sample or samples; and (vi) altering the 
administration of the agent to the subject accordingly. For example, increased 
administration of the agent may be desirable to reduce expression or activity of the 
polypeptide, i.e., to increase the effectiveness of the agent. 

Definitions 

[00441] A "native sequence" comprises a polypeptide having the same amino 

acid sequence as a protein of the present invention derived from nature. Such native 
sequences can be isolated from nature or can be produced by recombinant and/or 
synthetic means. The term "native sequence" specifically encompasses naturally 
occurring truncated or secreted forms {e.g., an extracellular domain sequence), 
naturally occurring variant forms {e.g., alternatively spliced forms), and naturally 
occurring allelic variants of the protein. In one embodiment of the invention, the 
native sequence of a protein of the present invention is a mature or full-length native 
sequence comprising amino acids encompassing the N-terminus to the C-terminus of 
the known sequence. 
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[00442] A "variant polypeptide" means an active polypeptide as defined herein 

having at least about 80% amino acid sequence identity with the amino acid sequence 
of the protein in Table 1. Such identity can be to the residues of the full-length 
polypeptide or to a specifically derived fragment of the amino acid sequence of the 
protein. Such variant polypeptides include, for instance, polypeptides wherein one or 
more amino acid residues are added, or deleted, at the N- and/or C-terminus, as well 
as within one or more internal domains of each amino acid sequence. Ordinarily, a 
variant polypeptide will have at least about 80% amino acid sequence identity, more 
preferably at least about 81 % amino acid sequence identity, more preferably at least 
about 82% amino acid sequence identity, more preferably at least about 83% amino 
acid sequence identity, more preferably at least about 84% amino acid sequence 
identity, more preferably at least about 85% amino acid sequence identity, more 
preferably at least about 86% amino acid sequence identity, more preferably at least 
about 87% amino acid sequence identity, more preferably at least about 88% amino 
acid sequence identity, more preferably at least about 89% amino acid sequence 
identity, more preferably at least about 90% amino acid sequence identity, more 
preferably at least about 91% amino acid sequence identity, more preferably at least 
about 92% amino acid sequence identity, more preferably at least about 93% amino 
acid sequence identity, more preferably at least about 94% amino acid sequence 
identity, more preferably at least about 95% amino acid sequence identity, more 
preferably at least about 96% amino acid sequence identity, more preferably at least 
about 97% amino acid sequence identity, more preferably at least about 98% amino 
acid sequence identity and yet more preferably at least about 99% amino acid 
sequence identity with either the full-length polypeptide or a specifically derived 
fragment of the amino acid sequence of the protein shown in Table 1. Variant 
polypeptides do not encompass the native polypeptide sequence. 
[00443] As used herein with respect to the polypeptide sequences, "percent (%) 

amino acid sequence identity" is defined as the percentage of amino acid residues in a 
candidate sequence that are identical with the amino acid residues in a protein of the 
present invention sequence, after aligning the sequences and introducing gaps, if 
necessary, to achieve the maximum percent sequence identity, and not considering 
any conservative substitutions as part of the sequence identity. Alignment for 
purposes of determining percent amino acid sequence identity can be achieved in 
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various ways that are within the skill in the art, for instance, using publicly available 
computer software such as BLAST, BLAST-2, ALIGN, ALIGN-2 or Megalign 
(DNASTAR) software. Those skilled in the art can determine appropriate parameters 
for measuring alignment, including £iny algorithms needed to achieve maximal 
alignment over the full-length of the sequences being compared. For purposes herein, 
however, % amino acid sequence identity values are obtained as described below by 
using the sequence comparison computer program ALIGN-2. The ALIGN-2 sequence 
comparison computer program was authored by Genentech, Inc. and the source code 
has been filed with user documentation in the U.S. Copyright Office, Washington 
D.C., 20559, where it is registered under U.S. Copyright Registration No. 
TXU5 10087. The ALIGN-2 program is publicly available through Genentech, Inc., 
South San Francisco, California. The ALIGN-2 program should be compiled for use 
on a UNIX operating system, preferably digital UNIX V4.01). All sequence 
comparison parameters are set by the ALIGN-2 program and do not vary. 
[00444] For purposes herein, the % amino acid sequence identity of a given 

amino acid sequence A to, with, or against a given amino acid sequence B (which can 
alternatively be phrased as a given amino acid sequence A that has or comprises a 
certain % amino acid sequence identity to, with, or against a given amino acid 
sequence B) is calculated as follows: 100 times the fraction XAf, where X is the 
number of amino acid residues scored as identical matches by the sequence alignment 
program ALIGN-2 in that program's alignment of A and B, and where Y is the total 
number of amino acid residues in B. It will be appreciated by one skilled in the art 
that where the length of amino acid sequence A is not equal to the length of amino 
acid sequence B, the % amino acid sequence identity of A to B will not equal the % 
amino acid sequence identity of B to A. 

[00445] Unless specifically stated otherwise, all % amino acid sequence 

identity values used herein are obtained as described above using the ALIGN-2 
sequence comparison computer program. However, % amino acid sequence identity 
can also be determined using the sequence comparison program NCBI-BLAST2 
(Altschul et aL, Nucleic Acids Res. 25:3389-3402 (1997)). The NCBI-BLAST2 
sequence comparison program can be downloaded from http://www.ncbi.nlm.nih.gov. 
NCBI-BLAST2 uses several search parameters, wherein all of those search 
parameters are set to default values including, for example, unmask = yes, strand = 
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all, expected occurrences = 10, minimum low complexity length = 1515, multi-pass e- 
value = 0.01, constant for multi-pass = 25, dropoff for final gapped alignment = 25 
and scoring matrix = BLOSUM62. 

[00446] In situations where NCBI-BLAST2 is employed for amino acid 

sequence comparisons, the % amino acid sequence identity of a given amino acid 
sequence A to, with, or against a given amino acid sequence B (which can 
alternatively be phrased as a given amino acid sequence A that has or comprises a 
certain % amino acid sequence identity to, with, or against a given amino acid 
sequence B) is calculated as follows: 100 times the fraction X/Y, where X is the 
number of amino acid residues scored as identical matches by the sequence alignment 
program NCBI-BLAST2 in that program's alignment of A and B, and where Y is the 
total number of amino acid residues in B. It will be appreciated that where the length 
of amino acid sequence A is not equal to the length of amino acid sequence B, the % 
amino acid sequence identity of A to B will not equal the % amino acid sequence 
identity of B to A. 

[00447] "Variant polynucleotide" or "variant nucleic acid sequence" means a 

\ nucleic acid molecule which encodes an active polypeptide, as defined herein, and 
which has at least about 80% nucleic acid sequence identity with either (a) a nucleic 
acid sequence which encodes residues comprising the known reading frame of the 
nucleic acid coding for the polypeptide shown in Table 1, or (b) a nucleic acid 
sequence which encodes another specifically derived fragment of the amino acid 
sequence shown in Table 1 . Ordinarily, a variant polynucleotide will have at least 
about 80% nucleic acid sequence identity, more preferably at least about 81% nucleic 
acid sequence identity, more preferably at least about 82% nucleic acid sequence 
identity, more preferably at least about 83% nucleic acid sequence identity, more 
preferably at least about 84% nucleic acid sequence identity, more preferably at least) 
about 85% nucleic acid sequence identity, more preferably at least about 86% nucleic 
acid sequence identity, more preferably at least about 87% nucleic acid sequence 
identity, more preferably at least about 88% nucleic acid sequence identity, more 
preferably at least about 89% nucleic acid sequence identity, more preferably at least 
about 90% nucleic acid sequence identity, more preferably at least about 91% nucleic 
acid sequence identity, more preferably at least about 92% nucleic acid sequence 
identity, more preferably at least about 93% nucleic acid sequence identity, more 
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preferably at least about 94% nucleic acid sequence identity, more preferably at least 
about 95% nucleic acid sequence identity, more preferably at least about 96% nucleic 
acid sequence identity, more preferably at least about 97% nucleic acid sequence 
identity, more preferably at least about 98% nucleic acid sequence identity and yet 
more preferably at least about 99% nucleic acid sequence identity with either (a) a 
nucleic acid sequence which encodes all residues of the polypeptide shown in Table 1 
, or (b) a nucleic acid sequence which encodes another specifically derived fragment 
of the amino acid sequence shown in Table 1 . Polynucleotide variants do not 
encompass the native nucleotide sequence. 

[00448] "Percent (%) nucleic acid sequence identity" with respect to the 

polypeptide encoding nucleic acid sequences identified herein is defined as the 
percentage of nucleotides in a candidate sequence that are identical with the 
nucleotides in a polypeptide-encoding nucleic acid sequence, after aligning the 
sequences and introducing gaps, if necessary, to achieve the maximum percent 
sequence identity. Alignment for purposes of determining percent nucleic acid 
sequence identity can be achieved in various ways that are within the skill in the art, 
for instance, using publicly available computer software such as BLAST, BLAST-2, 
ALIGN, ALIGN-2 or Megalign (DNASTAR) software. Those skilled in the art can 
determine appropriate parameters for measuring alignment, including any algorithms 
needed to achieve maximal alignment over the full-length of the sequences being 
compared. For purposes herein, however, % nucleic acid sequence identity values are 
obtained as described below by using the sequence comparison computer program 
ALIGN-2. The ALIGN-2 sequence comparison computer program was authored by 
Genentech, Inc., and the source code has been filed with user documentation in the 
U.S. Copyright Office, Washington D.C., 20559, where it is registered under U.S. 
Copyright Registration No. TXU5 10087. The ALIGN-2 program is publicly available 
through Genentech, Inc., South San Francisco, California. The ALIGN-2 program 
should be compiled for use on a UNIX operating system, preferably digital UNIX 
V4.01). All sequence comparison parameters are set by the ALIGN-2 program and do 
not vary. 

[00449] For purposes herein, the % nucleic acid sequence identity of a given 
nucleic acid sequence C to, with, or against a given nucleic acid sequence D (which 
can alternatively be phrased as a given nucleic acid sequence C that has or comprises 
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a certain % nucleic acid sequence identity to, with, or against a given nucleic acid 
sequence D) is calculated as follows: 100 times the fraction W/Z, where W is the 
number of nucleotides scored as identical matches by the sequence alignment program 
ALIGN-2 in that program's alignment of C and D, and where Z is the total number of 
nucleotides in D. It will be appreciated that where the length of nucleic acid sequence 
C is not equal to the length of nucleic acid sequence D, the % nucleic acid sequence 
identity of C to D will not equal the % nucleic acid sequence identity of D to C. 
[00450] Unless specifically stated otherwise, all % nucleic acid sequence 
identity values used herein are obtained as described above using the ALIGN-2 
sequence comparison computer program. However, % nucleic acid sequence identity 
can also be determined using the sequence comparison program NCBI-BLAST2 
(Altschul et al. Nucleic Acids Res. 25:33893402 (1997)). The NCBI>BLAST2 
sequence comparison program can be downloaded from http://www.ncbi.nlm.nih.gov. 
NCBI-BLAST2 uses several search parameters, wherein all of those search 
parameters are set to default values including, for example, unmask = yes, strand == 
all, expected occurrences = 10, minimum 'low complexity length = 15/5, multi-pass e- 
value = 0.01 , constant for multi-pass = 25, dropoff for final gapped alignment = 25 
and scoring matrix = BLOSUM62. 

[00451] In situations where NCBI-BLAST2 is employed for sequence 

comparisons, the % nucleic acid sequence identity of a given nucleic acid sequence C 
to, with, or against a given nucleic acid sequence D (which can alternatively be 
phrased as a given nucleic acid sequence C that has or comprises a certain % nucleic 
acid sequence identity to, with, or against a given nucleic acid sequence D) is 
calculated as follows: 100 times the fraction W/Z, where W is the number of 
nucleotides scored as identical matches by the sequence alignment program 
NCBIBLAST2 in that program's alignment of C and D, and where Z is the total 
number of nucleotides in D. It will be appreciated that where the length of nucleic 
acid sequence C is not equal to the length of nucleic acid sequence D, the % nucleic 
acid sequence identity of C to D will not equal the % nucleic acid sequence identity of 
Dto C. 

[00452] In other embodiments, variant polynucleotides are nucleic acid 
molecules that encode an active polypeptide and which are capable of hybridizing, 
preferably under stringent hybridization and wash conditions, to nucleotide sequences 
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encoding the full-length polypeptide shown in Table 1. Variant polypeptides can be 
those that are encoded by a variant polynucleotide. 

[00453] The term "conservative", in the context of the amino acid sequence 
identity comparisons performed as described above, includes amino acid residues that 
have similar properties. Preferred conservative substitutions are shown in Table 2. 

TABLE 2 
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Thr (T) 
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Val (V) 
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[00454] For purposes herein, the % value of positives of a given amino acid 
sequence A to, with, or against a given amino acid sequence B (which can 
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alternatively be phrased as a given amino acid sequence A that has or comprises a 
certain % positives to, with, or against a given amino acid sequence B) is calculated as 
follows: 100 times the fraction VY, where X is the number of amino acid residues 
scoring a positive value as defined above by the sequence alignment program 
ALIGN-2 in that program's alignment of A and B, and where Y is the total number of 
amino acid residues in B. It will be appreciated that where the length of amino acid 
sequence A is not equal to the length of amino acid sequence B, the % positives of A 
to B will not equal the % positives of B to A. 

[00455] The term "isolated," when used to describe the various polypeptides 

disclosed herein, means polypeptide that has been identified and separated and/or 
recovered from a component of its natural environment. Preferably, the isolated 
polypeptide is free of association with all components with which it is naturally 
associated. Contaminant components of its natural environment are materials that 
would typically interfere with diagnostic or therapeutic uses for the polypeptide, and 
can include enzymes, hormones, and other proteinaceous or non-proteinaceous 
solutes. In preferred embodiments, the polypeptide will be purified (1) to a degree 
sufficient to obtain at least 15 residues of N-terminal or internal amino acid sequence 
by use of a spinning cup sequenator, or (2) to homogeneity by SDS-PAGE under non- 
reducing or reducing conditions using Coomassie blue or, preferably, silver stain. 
Isolated polypeptide includes polypeptide in situ within recombinant cells, since at 
least one component of the protein natural environment will not be present. 
Ordinarily, however, isolated polypeptide will be prepared by at least one purification 
step. 

[00456] An "isolated" nucleic acid molecule encoding a polypeptide is a 

nucleic acid molecule that is identified and separated from at least one contaminant 
nucleic acid molecule with which it is ordinarily associated in the natural source of 
the nucleic acid. Preferably, the isolated nucleic is free of association with all 
components with which it is naturally associated. An isolated nucleic acid molecule is 
one that is other than in the form or setting in which it is found in nature. Isolated 
nucleic acid molecules therefore are distinguished from nucleic acid molecules as they 
exist in natural cells. However, an isolated nucleic acid molecule encoding a 
polypeptide includes nucleic acid molecules contained in cells that ordinarily express 
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polypeptides where, for example, the nucleic acid molecule is in a chromosomal 
location different from that of natural cells. 

[00457] The term "control sequences" refers to DNA sequences necessary for 

the expression of an operably linked coding sequence in a particular host organism. 
The control sequences that are suitable for prokaryotes, for example, include a 
promoter, optionally an operator sequence, and a ribosome-binding site. Additionally, 
eukaryotic cells are known to utilize promoters, polyadenylation signals, and 
enhancers. 

[00458] A nucleic acid is "operably linked" when it is placed into a functional 

relationship with another nucleic acid sequence. For example, DNA for a presequence 
or secretory leader is operably linked to DNA for a polypeptide if it is expressed as a 
preprotein that participates in the secretion of the polypeptide; a promoter or enhancer 
is operably linked to a coding sequence if it affects the transcription of the sequence; 
or a ribosome binding site is operably linked to a coding sequence if it is positioned so 
as to facilitate translation. Generally, "operably linked" means that the DNA 
sequences being linked are contiguous, and, in the case of a secretory leader, 
contiguous and in reading phase. However, enhancers do not have to be contiguous. 
Linking is accomplished by ligation at convenient restriction sites. If such sites do not 
exist, the synthetic oligonucleotide adaptors or linkers are used in accordance with 
conventional practice. 

[00459] The term "antibody" is used in the broadest sense and specifically 

covers, for example, single monoclonal antibodies (including agonist, antagonist, and 
neutralizing antibodies), antibody compositions with polyepitopic specificity, single 
chain antibodies, and fragments of antibodies. The term "monoclonal antibody" as 
used herein refers to an antibody obtained from a population of substantially 
homogeneous antibodies, Le,, the individual antibodies comprising the population are 
identical except for possible naturally occurring mutations that can be present in 
minor amounts. The "stringency" of hybridization reactions is readily determinable by 
one of ordincu-y skill in the art, and generally is an empirical calculation dependent 
upon probe length, washing temperature, and salt concentration. In general, longer 
probes require higher temperatures for proper annealing, while shorter probes need 
lower temperatures. Hybridization generally depends on the ability of denatured DNA 
to reanneal when complementary strands are present in an environment below their 
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melting temperature. The higher the degree of desired homology between the probe 
and hybridizable sequence, the higher the relative temperature, which can be used. As 
a result, it follows that higher relative temperatures would tend to make the reaction 
conditions more stringent, while lower temperatures less so. For additional details and 
explanation of stringency of hybridization reactions, see Ausubel et aL, Current 
Protocols in Molecular Biology, Wiley Interscience Publishers, (1995). 
[00460] "Stringent conditions" or "high stringency conditions", as defined 

herein, can be identified by those that: (1) employ low ionic strength and high 
temperature for washing, for example 0.015 M sodium chlonide/0.0015 M sodium 
citrate/0. l%^sodium dodecyl sulfate at 50°C; (2) employ during hybridization a 
denaturing agent, such as formamide, for example, 50% (v/v) formamide with 0.1 % 
bovine serum albumin/0.1% Ficoll/0.1 % polyvinylpyrrolidone/50mM sodium 
phosphate buffer at pH 6.5 with 750 mM sodium chloride, 75 mM sodium citrate at 
42°C; or (3) employ 50% formamide, 5 x SSC (0.75 M NaCl, 0.075 M sodium 
citrate), 50 mM sodium phosphate (pH 6.8), 0. 1 % sodium pyrophosphate, 5 x 
Denhardt^s solution, sonicated salmon sperm DNA (50[ig/ml), 0. 1 % SDS, and 10% 
dextran sulfate at 42*^C, with washes at 4T'C in 0.2 x SSC (sodium chloride/sodium 
citrate) and 50% formamide at 55°C, followed by a high-stringency wash consisting 
of 0.1 X SSC containing EDTA at 55°C. 

[00461] "Active" or "activity" for the purposes herein refers to form(s) of 

polypeptide which retain a biological and/or an immunological activity of native or 
naturally-occurring polypeptide, wherein "biological" activity refers to a biological 
function (either inhibitory or stimulatory), which includes enzymatic activity, caused 
by a native or naturally-occurring polypeptide other than the ability to induce the 
production of an antibody against an antigenic epitope possessed by a native or 
naturally-occurring polypeptide and an "immunological" activity refers to the ability 
to induce the production of an antibody against an antigenic epitope possessed by a 
native or naturally-occurring polypeptide. A preferred biological activity includes, for 
example, the property of the polypeptide to degrade extracellular matrix as for 
example in the case of proteases discovered to do so described in this invention. 
[00462] The term "antagonist" is used in the broadest sense, and includes any 

molecule that partially or fully blocks, inhibits, or neutralizes a biological activity of a 
native polypeptide disclosed herein. In a similar manner, the term "agonist" is used in 
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the broadest sense and includes any molecule that mimics a biological activity of a 
native polypeptide disclosed herein. Suitable agonist or antagonist molecules 
specifically include agonist or antagonist antibodies or antibody fragments, fragments 
or amino acid sequence variants of native polypeptides, peptides, antisense molecules, 
and small organic molecules. Methods for identifying agonists or antagonists of a 
polypeptide include contacting a polypeptide, mRNA or gene with a candidate agonist 
or antagonist molecule and measuring a detectable change in one or more biological 
activities normally associated with the polypeptide. 

[00463] "Treatment" refers to both therapeutic treatment and prophylactic or 

preventative measures, wherein the object is to prevent or slow down (lessen) the 
targeted pathologic condition or disorder. More specifically, "treatment" is an 
intervention performed with the intention of preventing the development or altering 
the pathology of a cancerous disorder. The concept of treatment is used in the 
broadest sense, and specifically includes the prevention (prophylaxis), moderation, 
reduction, and curing of cancerous disorders of any stage. Accordingly, "treatment" 
refers to both therapeutic treatment and prophylactic or preventative measures, 
wherein the object is to prevent or slow down (lessen) a cancerous disorder. The 
disorder may result from any cause. Subjects in need of treatment include those 
already with the disorder as well as those susceptible to the disorder or those in whom 
the disorder needs to be prevented. 

[00464] "Chronic" administration refers to administration of the agent(s) in a 

continuous mode as opposed to an acute mode, so as to maintain the initial therapeutic 

effect (activity) for an extended period of time. 

[00465] "Intermittent" administration is treatment that is not consecutively done 

without interruption, but rather is cyclic in nature. 

[00466] "Microarray" refers to an array of distinct polynucleotides or 

oligonucleotides arranged on a substrate such as paper, nylon, or other type of 
membrane, filter, gel, polymer, chip, glass slide, or any other suitable support, 
including solid supports. The polynucleotides or oligonucleotides (the backbone 
chemistry can be any available in the art) can be synthesized on a substrate or 
prepared before application to the substrate. 
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[00467] Administration "in combination with" one or more further therapeutic 

agents includes both simultaneous (concurrent) and consecutive administration in any 
order of one or more therapeutic agents. 

[00468] "Carriers" as used herein include pharmaceutically acceptable carriers, 

excipients, or stabilizers, which are nontoxic to the cell or mammal being exposed 
thereto at the dosages and concentrations, employed. Often the physiologically 
acceptable carrier is an aqueous pH buffered solution. Examples of physiologically 
acceptable carriers include buffers such as phosphate, citrate, and other organic acids; 
antioxidants including ascorbic acid; low molecular weight (Le., less than about 10 
residues) polypeptide; proteins, such as serum albumin, gelatin, or immunoglobulins; 
hydrophilic polymers such as polyvinylpyrrolidone; amino acids such as glycine, 
glutamine, asparagine, arginine or lysine; monosaccharides, disaccharides, and other 
carbohydrates including glucose, mannose, or dextrins; chelating agents such as 
EDTA; sugar alcohols such as mannitol or sorbitol; lipids such as cationic lipids, salt- 
forming counterions such as sodium; and/or nonionic surfactants such as TWEEN*^m 
polyethylene glycol (PEG), and PLURONICS™. 

[00469] "Antibody fragments" comprise a portion of an intact antibody, 

preferably the antigen binding or variable region of the intact antibody. Examples of 
antibody fragments include Fab, Fab', F(ab*)2» and Fv fragments; diabodies; linear 
antibodies (see Zapata etaL, Protein Eng. 8(10): 1057-1062 (1995)); single-chain 
antibody molecules; and multispecific antibodies formed from antibody fragments. 
[00470] Papain digestion of antibodies produces two identical antigen-binding 

fragments, called "Fab" fragments, each with a single antigen-binding site, and a 
residual "fc" fragment, a designation reflecting the ability to crystallize readily. Pepsin 
treatment yields an F(ab')2 fragment that has two antigen-combining sites and is still 
capable of cross-linking antigen. 

[00471] "Fv" is the minimum antibody fragment that contains a complete 

antigen-recognition and antigen-binding site. This region consists of a dimer of one 
heavy- and one light-chain variable domain in tight, non-covalent association. It is in 
this configuration that the three CDRs of each variable domain interact to define an 
antigen-binding site on the surface of the VH-VL dimer. Collectively, the six CDRs 
confer antigen-binding specificity to the antibody. However, even a single variable 
domain (or half of an Fv comprising only three CDRs specific for an antigen) has the 
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ability to recognize and bind antigen, although at a lower affinity than the entire 
binding site. 

[00472] The Fab fragment also contains the constant domain of the light chain 
and the first constant domain (CHI) of the heavy chain. Fab fragments differ from 
Fab' fragments by the addition of a few residues at the carboxy terminus of the heavy 
chain CHI domain including one or more cysteines from the antibody hinge region. 
Fab-SH is the designation herein for Fab' in which the cysteine residue(s) of the 
constant domains bear a free thiol group. F(ab*)2 antibody fragments originally were 
produced as pairs of Fab* fragments which have hinge cysteines between them. Other 
chemical couplings of antibody fragments are also known to one of ordinary skill in 
the art. 

[00473] The "light chains" of antibodies (immunoglobulins) from any 

vertebrate species can be assigned to one of two clearly distinct types, called kappa 
and lambda, based on the amino acid sequences of their constant domains. 
[00474] Depending on the amino acid sequence of the constant domain of their 

heavy chains, immunoglobulins can be assigned to different classes. There are five 
major classes of immunoglobulins: IgA, IgD, IgE, IgG, and IgM, and several of these 
can be further divided into subclasses (isotypes), e.^., IgGl, IgG2, IgG3, IgG4, IgA, 
and IgA2. 

[00475] "Single-chain Fv" or "sFv" antibody fragments comprise the VH and 

VL domains of antibody, wherein these domains are present in a single polypeptide 
chain. Preferably, the Fv polypeptide further comprises a polypeptide linker between 
the VH and VL domains, which enables the sFv to form the desired structure for 
antigen binding. For a review of sFv, see Pluckthun in The Pharmacology of 
Monoclonal Antibodies, vol. 1 13, Rosenburg and Moore eds., Springer- Verlag, New 
York, pp. 269-315(1994). 

[00476] The term "diabodies" refers to small antibody fragments with two 
antigen-binding sites, which fragments comprise a heavy-chain variable domain (VH) 
connected to a light chain variable domain (VL) in the same polypeptide chain (VH - 
VL). By using a linker that is too short to allow pairing between the two domains on 
the same chain, the domains are forced to pair with the complementary domains of 
another chain and create two antigen-binding sites. Diabodies are described more 
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fully in, for example, EP 404,097; WO 93/1 1 161 ; and Hollinger et aL, Proc. NatL 
Acad. ScL USA, 90:6444-6448 (1993). 

[00477] An "isolated" antibody is one that has been identified and separated 

and/or recovered from a component of its natural environment. Contaminant 
components of its natural environment are materials, which would interfere with 
diagnostic or therapeutic uses for the antibody, and can include enzymes, hormones, 
and other proteinaceous or non-proteinaceous solutes. In preferred embodiments, the 
antibody will be purified (1) to greater than 95% by weight of antibody as determined 
by the Lowry method, and most preferably more than 99% by weight, (2) to a degree 
sufficient to obtain at least 15 residues of N-terminal or internal amino acid sequence 
by use of a spinning cup sequenator, or (3) to homogeneity by SDS-PAGE under 
reducing or nonreducing conditions using Coomassie blue or, preferably, silver stain. 
Isolated antibody includes the antibody in situ within recombinant cells since at least 
one component of the antibody's natural environment will not be present. Ordinarily, 
however, isolated antibody will be prepared by at least one purification step. 
[00478] The word "label", when used herein, refers to a detectable compound 
or composition, which is conjugated directly or indirectly to the antibody so as to 
generate a "labeled" antibody. The label can be detectable by itself (e.g, radioisotope 
labels or fluorescent labels) or, in the case of an enzymatic label, can catalyze 
chemical alteration of a substrate compound or composition, which is detectable.^ 
[00479] By "solid phase" is meant a non-aqueous matrix to which the antibody 

of the present invention can adhere. Examples of solid phases encompassed herein 
include those formed partially or entirely of glass (e.g., controlled pore glass), 
polysaccharides (e.g., agarose), polyacrylamides, polystyrene, polyvinyl alcohol and 
silicones. In certain embodiments, depending on the context, the solid phase can 
comprise the well of an assay plate; in others it is a purification column (eg,, an 
affinity chromatography column). This term also includes a discontinuous solid phase 
of discrete particles, such as those described in U.S. Patent No. 4,275,149. 
[00480] As used herein, the terms "cancer" or "cancerous" refer to or describe 
the physiological conditions in mammals that are typically characterized by 
unregulated cell growth. This can include benign growth, pre-malignant growth or 
malignant growth wherein the cells of the primary growth have spread to other sites. 
As used herein, the terms include any neoplasia, described as "an abnormal mass of 
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tissue, the growth of which exceeds and is uncoordinated with that of the normal 
tissues and persists in the same excessive manner after cessation of the stimuli which 
evoke the change." (referenced in Robbins, S. L. Pathologic Basis of Disease, W. B. 
Saunders Co. 1974). Examples of cancers include but are not limited to, carcinomas 
including adenocarcinoma, lymphoma, blastoma, melanoma, sarcoma, and leukemia. 
More particular examples of such cancers include squamous cell cancer, small-cell 
liing cancer, non-small cell lung cancer, gastrointestinal cancer, Hodgkin's and non- 
Hodgkin*s lymphoma, pancreatic cancer, glioblastoma, cervical cancer, ovarian 
cancer, liver cancers such as hepatic carcinoma and hepatoma, bladder cancer, breast 
cancer, colon cancer, colorectal cancer, endometrial carcinoma, salivary gland 
carcinoma, kidney cancer such as renal cell carcinoma and Wilms' tumors, bcisal cell 
carcinoma, melanoma, prostate cancer, vulval cancer, thyroid cancer, testicular 
cancer, esophageal cancer, and various types of head and neck cancer. The preferred 
cancers for treatment according to the methods of the invention described herein are 
breast, colon, lung, melanoma, ovarian, and prostate cancer. 

[00481] The term "cytotoxic agent" as used herein refers to a substance that 

inhibits or prevents the function of cells and/or causes destruction of cells. The term is 
intended to include radioactive isotopes (e.g., '"^'l, ^^I, ^°Y, and ^^^Re), 
chemotherapeutic agents, and toxins such as enzymatically active toxins of bacterial, 
fungal, plant, or animal origin, or fragments thereof. 

[00482] A "chemotherapeutic agent" is a chemical compound useful in the 

treatment of cancer. Examples of chemotherapeutic agents include alkylating agents, 
folic acid antagonists, anti-metabolites of nucleic acid metabolism, antibiotics, 
pyrimidine analogs, 5-fIuorouracil, cisplatin, purine nucleosides, amines, amino acids, 
triazol nucleosides, or corticosteroids. Specific examples include Adriamycin, 
Doxorubicin, 5-Fluorouracil, Cytosine arabinoside ("Ara-C"), Cyclophosphamide, 
Thiotepa, Busulfan, Cytoxin, Taxol, Toxotere, Methotrexate, Cisplatin, Melphalan, 
Vinblastine, Bleomycin, Etoposide, Ifosfamide, Mitomycin C, Mitoxantrone, 
Vinblastine, Vinorelbine, Carboplatin, Teniposide, Daunomycin, Carminomycin, 
Aminopterin, Dactinomycin, Mitomycins, Esperamicins (See U.S. Pat. No. 
4,675,187), Melphalan, and other related nitrogen mustards. Also included in this 
definition are hormonal agents that act to regulate or inhibit hormone action on 
tumors, such as tamoxifen and onapristone. 
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[00483] "Growth-inhibitory agent" when used herein refers to a compound or 

composition that inhibits growth of a cell, such as a Wnt-overexpressing cancer cell, 
either in vitro or in vivo. Thus, a growth-inhibitory agent is one that significantly 
reduces the percentage of malignant cells in S phase. Examples of growth-inhibitory 
agents include agents that block cell cycle progression (at a place other than S phase), 
such as agents that induce GI arrest and M-phase arrest. Cleissical M-phase blockers 
include the vincas (vincristine and vinblastine), taxol, and topo 1 1 inhibitors such as 
doxorubicin, daunorubicin, etoposide, and bleomycin. Those agents that arrest Gl 
also spill over into S-phase arrest, for example, DNA gilkylating agents such as 
tamoxifen, prednisone, dacarbazine, mechlorethamine, cisplatin, methotrexate, 5- 
fluorouracil, and ara-C. Further information on growth-inhibitory agents can be found 
in The Molecular Basis of Cancer, Mendelsohn and Israel, eds., Chapter 1, entitled 
"Cell cycle regulation, oncogenes, and antineoplastic drugs" by Murakami et at., 
(VVB Saunders: Philadelphia, 1995), p. 13. Additional examples include tumor 
necrosis factor (TNF), an antibody capable of inhibiting or neutralizing the angiogenic 
activity of acidic or basic FGF or hepatocyte growth factor (HGF), an antibody 
capable of inhibiting or neutralizing the coagulant activities of tissue factor, protein C, 
or protein S (see, WO 91/01753, published 21 February 1991), or an antibody capable 
of binding to HER2 receptor (WO 89/06692), such as the 4D5 antibody (and 
functional equivalents thereof) (e.g., WO 92/22653). 

[00484] In a pharmacological sense, in the context of the present invention, a 

"therapeutically effective amount" of an active agent such as a polypeptide or agonist 
or antagonist thereto or an antibody, refers to an amount effective in the treatment of a 
cancerous disorder in a mammal and can be determined empirically. Determination of 
a therapeutically effective amount can be accomplished by any method known to 
those skilled in the art, 

[00485] As used herein, an "effective amount" of an active agent such as a 
polypeptide or agonist or antagonist thereto or an antibody, refers to an amount 
effective for carrying out a stated purpose, wherein such amounts may be determined 
empirically for the desired effect. 

[00486] The herein described nucleic acids, polypeptides, antibodies, agonists, 
and antagonists, when used therapeutically are referred to herein as "Therapeutics". 
Methods of administration of therapeutics include, but are not limited to, intradermal. 
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intramuscular, intraperitoneal, intravenous, subcutaneous, intranasal, epidural, and 
oral routes. The therapeutics of the present invention may be administered by any 
convenient route, for example by infusion or bolus injection, by absorption through 
epithelial or mucocutaneous linings (e.g., oral mucosal, rectal and intestinal mucosal, 
etc.) and may be administered together with other biologically-active agents. 
Administration can be systemic or local. In addition, it may be advantageous to 
administer the therapeutic into the central nervous system by any suitable route, 
including intraventricular and intrathecal injection. 

[00487] Intraventricular injection may be facilitated by an intraventricular 

catheter attached to a reservoir (e.g., an Ommaya reservoir). Pulmonary 
administration may also be employed by use of an inhaler or nebulizer, and 
formulation with an aerosolizing agent. It may also be desirable to administer the 
therapeutic locally to the area in need of treatment; this may be achieved by, for 
example, and not by way of limitation, local infusion during surgery, topical 
application, by injection, by means of a catheter, by means of a suppository, or by 
means of an implant. In a specific embodiment, administration may be by direct 
injection at the site (or former site) of a malignant tumor or neoplastic or pre- 
neoplastic tissue. 

[00488] Various delivery systems are known and can be used to administer a 

therapeutic of the present invention including, e,g,: (i) encapsulation in liposomes, 
microparticles, microcapsules; (ii) recombinant cells capable of expressing the 
therapeutic; (iii) receptor mediated endocytosis (See, e.g., Wu and Wu, 1987. J Biol 
Chem 262:4429-4432); (iv) construction of a therapeutic nucleic acid as part of a 
retroviral or other vector, and the like. 

[00489] In one embodiment of the present invention, the therapeutic may be 

delivered in a vesicle, in particular a liposome. In a liposome, the protein of the 
present invention is combined, in addition to other pharmaceutically acceptable 
carriers, with amphipathic agents such as lipids, which exist in aggregated form as 
micelles, insoluble monolayers, liquid crystals, or lamellar layers in aqueous solution. 
Suitable lipids for liposomal formulation include, without limitation, monoglycerides, 
diglycerides, sulfatides, lysolecithin, phospholipids, saponin, bile acids, and the like. 
Preparation of such liposomal formulations is within the level of skill in the art, as 
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disclosed, for example, in U.S. Pat. No. 4,837,028; and U.S. Pat. No. 4,737,323, all of 
which are incorporated herein by reference. 

[00490] In yet another embodiment, the therapeutic can be delivered in a 

controlled release system including, e.g.: a delivery pump (See, e.g., Saudek, et aL, 
1989. New Engl J Med 321 :574 and a semi-permeable polymeric material (See, e.g., 
Howard, et aL, 1989. J Neurosurg 71:105). Additionally, the controlled release 
system can be placed in proximity of the therapeutic target (e.g., the brain), thus 
requiring only a fraction of the systemic dose. See, e.g., Goodson, In: Medical 
Applications of Controlled Release J 984. (CRC Press, Bocca Raton, FL). 
[00491] In a specific embodiment of the present invention, where the 

therapeutic is a nucleic acid encoding a protein, the therapeutic nucleic acid may be 
administered in vivo to promote expression of its encoded protein, by constructing it 
as part of an appropriate nucleic acid expression vector and administering it so that it 
becomes intracellular (e,g. , by use of a retroviral vector, by direct injection, by use of 
microparticle bombardment, by coating with lipids or cell-surface receptors or 
transfecting agents, or by administering it in linkage to a homeobox-like peptide 
which is known to enter the nucleus (See, e.g., Joliot, et aL, 1991 . Proc Natl Acad Sci 
USA 88: 1864-1868), and the like. Alternatively, a nucleic acid therapeutic can be 
introduced intracellularly and incorporated within host cell DNA for expression, by 
homologous recombination. 

[00492] As used herein, the term "therapeutically effective amount" means the 

total amount of each active component of the pharmaceutical composition or method 
that is sufficient to show a meaningful patient benefit, i.e,, treatment, healing, 
prevention or amelioration of the relevant medical condition, or an increase in rate of 
treatment, healing, prevention or amelioration of such conditions. When applied to an 
individual active ingredient, administered alone, the term refers to that ingredient 
alone. When applied to a combination, the term refers to combined amounts of the 
active ingredients that result in the therapeutic effect, whether administered in 
combination, serially or siniultaneously. 

[00493] The amount of the therapeutic of the invention which will be effective 

in the treatment of a particular disorder or condition will depend on the nature of the 
disorder or condition, and may be determined by standard clinical techniques by those 
of average skill within the art. In addition, in vitro assays may optionally be employed 
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to help identify optimal dosage ranges. The precise dose to be employed in the 
formulation will also depend on the route of administration, and the overall^ 
seriousness of the disease or disorder, and should be decided according to the 
judgment of the practitioner and each patients circumstances. Ultimately, the 
attending physician will decide the amount of protein of the present invention with 
which to treat each individual patient. Initially, the attending physician will administer 
low doses of protein of the present invention and observe the patient's response. 
Larger doses of protein of the present invention may be administered until the optimal 
therapeutic effect is obtained for the patient, and at that point the dosage is not 
increased further. However, suitable dosage ranges for intravenous administration of 
the therapeutics of the present invention are generally about 20-500 micrograms ((jg) 
of active compound per kilogram (Kg) body weight. Suitable dosage ranges for 
intranasal administration are generally about 0.01 pg/kg body weight to 1 mg/kg body 
weight. Effective doses may be extrapolated from dose-response curves derived from 
in vitro or animal model test systems. Suppositories generally contain active 
ingredient in the range of 0,5% to 10% by weight; oral formulations preferably 
contain 10% to 95% active ingredient. 

[00494] The duration of intravenous therapy using the pharmaceutical 

composition of the present invention will vary, depending on the severity of the 
disease being treated and the condition and potential idiosyncratic response of each 
individual patient. It is contemplated that the duration of each application of the 
protein of the present invention will be in the range of 12 to 24 hours of continuous 
intravenous administration. Ultimately the attending physician will decide on the 
appropriate duration of intravenous therapy using the pharmaceutical composition of 
the present invention. 

[00495] Unless otherwise defined, all technical and scientific terms used herein 

have the same meaning as commonly understood by one of ordinary skill in the art to 
which this invention pertains. In case of conflict, the present specification, including 
definitions, will control. In addition, the materials, methods, and examples are 
illustrative only and are not intended to be limiting. Although methods and materials 
similar or equivalent to those described herein can be used in the practice or testing of 
the present invention, suitable methods and materials are described below. 
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[00496] All commercially available reagents referred to in the examples were 

used according to manufacturer's instructions unless otherwise indicated. The source 
of those cells identified by ATCC accession numbers in the following examples, and 
throughout the specification, is the American Type Culture Collection, Manassas, VA. 
[00497] All publications, patent applications, patents, and other references 
mentioned herein are incorporated by reference in their entirety. 
[00498] The invention will be further described in the following examples, 
which do not limit the scope of the invention described in the claims. The following 
examples are offered for illustrative purposes only, and are not intended to limit the 
scope of the present invention in any way. 

EXAMPLES 

EXAMPLE! 
Sample Preparation 

[00499] Total RNA was isolated from 10 primary colon tumors and 10 normal 

colon samples using the Ambion Totally RNA kit for isolation of total cellular RNA, 
catalog number #1910, 2130 Woodward Street, Austin, Texas 78744-1832. 
According to the protocol the samples were lysed in a guanidinium based lysis 
solution and were then extracted sequentially with a Phenol:Chloroform:IAA and 
Acid-Phenol rChloroform. The RNA is then precipitated with isopropanol. Poly A+ 
RNA was extracted using the Oligotex mRNA midi kit, catalog number #70042, 
28159 Avenue Stanford, Valencia, California, 91355. Using this kit the poly A+ 
RNA was purified by hybridizing the poly A+ tail to a dT oligomer coupled to a solid- 
phase matrix. 

EXAMPLE 2 
Probe Generation 

[00500] Each polyA"^ RNA sample was reverse transcribed using MMLV 
reverse-transcriptase, 0.05 pg/ul oligo-dT primer (21mer), Ixfirst strand buffer, 0.03 
units/ul RNase inhibitor, 500 uM dATP, 500 uM dGTP, 500 uM dTTP, 40 uM dCTP, 
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40 uM dCTP- Cy3 (BDS) or dCTP-Cy5 (Amersham Pharmacia Biotech). The reverse 
transcription reaction was performed in a 25 ml volume containing 200 ng polyA 
RNA with GEMBRIGHT kits. Specific control polyA ""RNAs (YCFR06, YCFR45, 
YCFR67, YCFR85, YCFR43, YCFR22, YCFR23, YCFR25, YCFR44, YCFR26) 
were synthesized by in vitro transcription from non-coding yeast genomic DNA (W. 
Lei, unpublished). As quantitative controls, the control mRNAs (YCFR06, YCFR45, 
YCFR67, YCFR85) at 0.002 ng, 0.02 ng, 0.2 ng, and 2 ng were diluted into reverse 
transcription reaction at ratios of 1:100,000, 1:10,000, 1: 1000, 1:100 (w/w) to sample 
mRNA respectively. The control mRNAs (YCFR43, YCFR22, YCFR23, YCFR25, 
YCFTR44, YCFR26) were diluted into reverse transcription reaction at ratios of 1:3, 
3:1, 1:10, 10:1, 1:25, 25:1 (w/w) to sample mRNA differential expression patterns. 
After incubation at 37° C. for 2 hr, each reaction sample (one with Cy3 and another 
with Cy5 labeling) was treated with 2.5 m 1 of 0.5M sodium hydroxide and incubated 
for 20 minutes at 85° C. to the stop the reaction and degrade the RNA. Probes were 
purified using two successive CHROMA SPIN 30 gel filtration spin columns . 
(Clontech, Palo Alto, Calif. USA) and after combining, both reaction samples were 
ethanol precipitated using 1 ml of glycogen (1 mg/ml), 60 ml sodium acetate, and 300 
ml of 100% ethanol. The probe was then dried to completion using a SpeedVAC 
(Savant) and resuspended in 14 ul 5xSSC/0.2% SDS. 

EXAMPLE 3 
Hvbridizaton 

[00501] Hybridization reactions contained 9 |li1 of probe mixture consisting of 

0.2 ^ig each of both Cy3 and Cy5 labeled cDNA synthesis products in 5xSSC, 0.2% 
SDS hybridization buffer. The probe mixture was heated to 65°C. for 5 minutes and 
was aliquoted onto the microarray surface and covered with an 1.8 cm ^coverslip. The 
arrays were transferred to a waterproof chamber having a cavity just slightly larger 
than a microscope slide. The chamber was kept at 100% humidity intemally by the 
addition of 140 |xl of 5xSSC in a comer of the chamber. The chamber containing the 
arrays was incubated for about 6.5 hours at 60*'C. The arrays were washed for 10 min 
at 45*'C. in high stringency wash buffer (IxSSC, 0.1% SDS), three times for 10 
minutes each at 45*'C. in low stringency wash buffer (0. 1 xSSC), and then dried. 
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EXAMPLE 4 
Detection 

[00502] The microscope used to detect the reporter-labeled hybridization 

complexes was equipped with an Innova 70 mixed gas 10 W laser (Coherent Lasers, 
Santa Clara, Calif.) capable of generating spectral lines at 488 nm for excitation of 
Cy3, and 632 nm for excitation of Cy5. The excitation laser light was focused on the 
array using a 20xmicroscope objective (Nikon). The slide containing the array was 
placed on a computer-controlled X-Y stage on the microscope and raster-scanned past 
the objective. The 1.8 cmxl.8 cm array used in the present example was scanned with 
a resolution of 20 micrometers. 

[00503] In two separate scans, a mixed gas multiline laser excited the two 

fluorophores sequentially. Emitted light was split, based on wavelength, into two 
photomultiplier tube detectors (PMT R1477, Hamamatsu Photonics, San Jose, Calif.) 
corresponding to the two fluorophores. Appropriate filters positioned between the 
array and the photomultiplier tubes were used to filter the signals. The emission 
maxima of the fluorophores used were 565 nm for Cy3 and 650 nm for Cy5. Each 
array was typically scanned twice, one scan per fluorophore using the appropriate 
filters at the laser source, although the apparatus was capable of recording the spectra 
from both fluorophores simultaneously. 

[00504] The sensitivity of the scans was typically calibrated using the signal 

intensity generated by a cDNA control species added to the probe mix at a known 
concentration. A specific location on the array contained a complementary DNA 
sequence, allowing the intensity of the signal at that location to be correlated with a 
weight ratio of hybridizing species of 1:100,000. When two probes from different 
sources (e.g., representing test and control cells), each labeled with a different 
fluorophore, are hybridized to a single array for the purpose of identifying genes that 
are differentially expressed, the calibration was done by labeling samples of the 
calibrating cDNA with the two fluorophores and adding identical amounts of each to 
the hybridization mixture. 

[00505] The output of the photomultiplier tube was digitized using a 12-bit 

RTI-835H analog-to-digital (A/D) conversion board (Analog Devices, Norwood, 
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Mass.) installed in an IBM-compatible PC computer. The digitized data were 
displayed as an image where the signal intensity was mapped using a linear 20-color 
transformation to a pseudocolor scale ranging from blue (low signal) to red (high 
signal). The data was also analyzed quantitatively. Where two different fluorophores 
were excited and measured simultaneously, the data were first corrected for optical 
crosstalk (due to overlapping emission spectra) between the fluorophores using each 
fluorophore's emission spectrum. 

[00506] A grid was superimposed over the fluorescence signal image such that 

the signal from each spot was centered in each element of the grid. The fluorescence 
signal within each element was then integrated to obtain a numerical value 
corresponding to the average intensity of the signal. The software used for signal 
analysis was the GEMTOOLS gene expression analysis program (Incvte) . 

EXAMPLE 5 
Microarrav Analvsis 

[00507] The analysis of hybridized microarrays is a multi-step procedure. 

Arrays were run through an initial quality control procedure that determines spot 
quality. Upon completion, spots passing a set of pre-defined standards were used to 
balance the Cy3 (tumor) and Cy5 (normal pool) signals for each gene using an 
internal method of background correction and signal normalization. The balanced 
signals were then used to calculate the balanced differential expression ratio for each 
transcript represented. The ratio was calculated as: 

Ratio = if Cy3 >= Cy5 

BalancedCyS 

Ratio ^^^^^^^^^^ ifCy3<Cy5 
Cy3 

[00508] After normalization the arrays were passed through another series of 
quality control measures, at the array level, to insure conficlence in the differential 
expression values. For arrays passing all quality control measures, a ratio value of 2 
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fold or greater was accepted as a significant differential both positive and negative 
ratios. The value of 2 fold was used as it has consistently demonstrated 90%-95% 
secondary validation rates. In the context of this experiment, a positive ratio indicates 
an up-regulation of the transcript in the tumor tissue. Transcripts that demonstrate up- 
regulation in at least 30% of the tumors profiled were identified for further 
bioinformatic analysis. The differential expression values were quality controlled 
internally and are shown in Table 3 (up-regulated transcripts) and Table 4 (down- 
regulated transcripts). 
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EXAMPLE 6 

Signal P Analysis And TMHMM Analysis 

[0041] Transcripts that were differentially expressed greater than 2 fold 

and in 30% or more of the tumors were further analyzed by Signal P and 
TMHMM analysis to determine if they were membrane associated. 

Signal Sequence and TMHMM Analysis 

[0042] SignalP V2.0 comprises two signal peptide prediction methods, 

SignalP-NN (based on neural networks, corresponding to SignalP VI. 1) and 
SignalP-HMM (based on a hidden Markov model). For eukaryotic data, 
SignalP-HMM has been shown to have substantially improved discrimination 
between signal peptides and uncleaved signal anchors, but it has a slightly lower 
accuracy in predicting the precise location of the cleavage site (Nielsen and 
Krogh, In Proceedings of the Sixth International Conference on Intelligent 
Systems for Molecular Biology, (1998). AAAI Press, Menlo Park, California, 
pp. 122-130). Since in all proteins examined, the predicted site of signal 
cleavage was exactly the same for both programs only the results of SignalP- 
HMM program are shown. TMHMM is membrane protein topology prediction 
program based on a HMM. This program has been shown to correctly predict 
97-98 % of the transmembrane helices (Krogh et al.^ J Mol Bio, 305:567- 
580(2001).). Additionally, TMHMM can discriminate between soluble and 
membrane proteins with both a greater than 99% specificity and sensitivity. 
[0043] Figures 1-31 shows the SignalP and TMHMM analysis of the 

protein sequences of PCTUC-5 (SEQ ID NO:39), PCTUC-93 (SEQ ID NO:46), 
PCTUC-190 (SEQ ID NO:57), PCTUC-239 (SEQ ID NO:65), PCTUC-246 
(SEQ ID NO:40), PCTUC-360 (SEQ ID NO:53), PCTUC-462 (SEQ ID 
NO:58), PCTUC-468 (SEQ ID NO:48), PCTUC-536 (SEQ ID NO:31 14), 
PCTUC-582 (SEQ ID NO:64), PCTUC-605 (SEQ ID NO:71), PCTUC-629 
(SEQ ID NO:67), PCTUC-722 (SEQ ID NO:61), PCTUC-748 (SEQ ID 
NO:63), PCTUC-784 (SEQ ID NO:72), PCTUC-812 (SEQ ID NO:66), 
PCTUC-856 (SEQ ID NO:49), PCTUC-898 (SEQ ID NO:43), PCTUC-935 
(SEQ ID NO:70), PCTUC-936 (SEQ ID NO:42), PCTUC-986 (SEQ ID 
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NO:47), PCTUC-991 (SEQ ED NO:75), PCTUC-992 (SEQ ID NO:60), 
PCTUC-1054 (SEQ ID NO:59). PCTUC-1061 (SEQ ID NO:55), PCTUC-1073 
(SEQ ID NO:56), PCTUC-1075 (SEQ ID NO:73), PCTUC-1078 (SEQ ID 
NO:68), PCTUC-1082 (SEQ ID NO:54), PCTUC-1 122 (SEQ ID NO:62), and 
PCTUC-250 (SEQ ID NO:41) respectively. The predictions of the SignalP and 
TMHMM analyses for subcellular localization and membrane topology of these 
peptides are summarized in Table 5. 

TABLE 5 



Gene 


Product Itocatlon 


Cleaved 
Signal 
Sequence 


Topology 


Number of 

TM 
Segments 


PCTUC5 


plasma membrane 


yes 


N-terminus outside & C- 
terminus inside 


1 


PCTUC93 


secreted 


yes 


N-terminus outside Sc C- 
terminus outside 


0 


PCTUC190 


plasma membrane 


NA 


N-terminus inside & C- 
terminus outside 


11 


PCTUC239 


secreted 


yes 


N-terminus outside & C- 
terminus outside 


0 


PCTUC246 


plasma membrane 


yes 


N-terminus outside & C- 
terminus inside 


1 


PCTCUC250 


Plasma membrane 


yes 


N-terminus inside & C- 
terminus inside 


7 


PCTUC360 


plasma membrane 


NA 


N-terminus inside & C- 
terminus inside 


6 


PCTUC462 


plasma membrane 


NA 


N-terminus outside & C- 
terminus outside 


12 


PCTUC4 6 8 


plasma membrane 


yes 


N-terminus outside & C- 
terminus inside 


1 


PCTUC536 


plasma membrane 


no 


N-terminus outside & C- 
terminus outside 


4 


PCTUC582 


secreted 


yes 


N-terminus outside & C- 
terminus outside 


0 


PCTUC605 


plasma membrane 


yes 


N-terminus outside & C- 

terminus inside 


1 


PCTUC629 


plasma membrane 


NA 


N-terminus inside & C- 
terminus inside 


12 


PCTUC722 


secreted 


yes 


N-terminus outside & C- 
terminus outside 


0 


PCTUC748 


plasma membrane 


no 


N-terminus inside & C- 
terminus inside 


4 


. PCTUC7 84 . 


plasma membrane 


yes 


N-terminus outside & C- 
termihus inside 


1 


PCTUC812 


plasma membrane 


yes 


N-terminus outside & C- 
terminus inside 


1 


PCTUC856 


plasma membrane 


no 


N-terminus inside & C- 
terminus inside 


12 


PCTUC898 


plasma membrane 


no 


N-terminus outside & C- 
terminus inside 


10 


PCTUC93 5 


plasma membrane 


no 


N-terminus inside & C- 
terminus inside 


6 


PCTUC936 


plasma membrane 


yes 


N-terminus outside & C- 
terminus inside 


1 


PCTUC986 


plasma membrcuie 


yes 


N-terminus outside & C- 
terminus inside 


3 
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Gene 


Product Location 


Cleaved 
Signal 
Sequence 


Topology 


Ntimber of 

TM 
Segments 


PCTUC991 


secreted 


yes 


N-terminus outside & C- 
terminus outside 


0 


PCTUC992 


plasma membrane 


NA 


N-terminus outside & C- 
terminus inside 


9 


PCTUC1054 


plasma^ membrane 


yes 


N-terminus outside & C- 
terminus inside 


1 


PCTUC1061 


plasma membrane 


NA 


N-terminus outside & C- 
terminus outside 


2 


PCTUC1073 


secreted 


yes 


N-terminus outside & C- 
terminus outside 


0 


PCTUC107 5 


plasma membrane 


NA 


N-terminus inside & C- 
terminus inside 


12 


PCTUC107 8 


secreted 


yes 


N-terminus outside & C- 
terminus outside 


0 


PCTUC1082 


plasma membrane 


yes 


N-terminus inside & C- 
terminus outside 


3 


PCTUC1122 


plasma membrane 


yes 


N-terminus outside & C- 
terminus inside 


1 












NA = not applicable (N- terminal signal sequence independent membrane insertion) 
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EXAMPLE 7 

Svbr Green Per Analysis Of The Time Dependence Of Expression Of Cell Surface 
Associated Genes. 

[0044] Where possible microarray results were confirmed using the SYBR 

green procedure. The SYBR green PGR procedure is a way to perform real-time PGR 
using the SYBR Green 1 Dye. Direct detection of polymerase chain reaction (PGR) 
product is monitored by measuring the increase in fluorescence caused by the binding 
of SYBR green dye to double stranded DNA. Gene specific PGR oligonucleotide 
primer pairs were designed using the Primer Express 1.5 software. (Applied 
Biosystems , Foster Gity, Ga) 

[0045] The transcripts whose protein products were determined to be 

associated with the membrane were then further vialidated by SYBR green (real time 
PGR) analysis on an additional set of either 6 or 13 colon and 6 breast tumor RNA's 
and were compared to a pool of normal colon or breast RNA. First one microgram of 
each mRNA was added to 100 uL of a reverse transcriptase reaction using the ABI 
Taqman reverse transcription reagents with random hexamers according to the 
manufacturers protocol (Applied Biosystems , Foster Gity, Ga) The thermal cycling 
conditions included 1 cycle at 25*'G for 10 minutes, 1 cycle at 48°G for 30 minutes and 
1 cycle at 95°C for 5 minutes. Four hundred microliters of water was then added to 
the cDNA reaction. The cDNA (10 uL) that was generated was added to a 25 uL 
SYBR green PGR reaction mixture according to the manufactures protocol. (Applied 
Biosystems , Foster Gity, Ga) The thermal cycling conditions included 1 cycle at 
95°G for ten minutes, 40 cycles at 95**G for 15 s , annealing at 60°G for 1 minute. 
Data is expressed as the fold increase normalized to the same gene using the delta 
delta GT method for relative quantitation. For comparison data obtained with cDNA 
from colon and breast turnors were compared to data from pools of normal colon and 
breast cDNA. 

[0046] Table 6 shows the SYBR green primers and probe sets used. Table 7 

shows the shows the SYBR green in results in the colon tumors (GT) and breast 
tumors (BT). ND = non-detectable 
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TABLE 6 



Forward Primer 



CGGAGACAGGCTATGAGTCTGA 
SEQ ID NO: 3012 

GCCAGCAACCTACATGAACTTG 
SEQ ID NO: 3 013 

CGACATGCTGGGAGATTACATC 
SEQ ID NO: 3014 

TGAGTCTGGGCAGCTGTCC 
SEQ ID NO: 3015 

CATCAGGTTGGAGTGCGTCTT 
SEQ ID NO: 3016 

GGATCAGCCCTGAACTCACT 
SEQ ID NO: 3017 

AGACAAGGATGCCGTGGATAA 
SEQ ID NO: 3018 

TCAATATAGATGATTGTGCCATCTTCT 
SEQ ID NO:3019 

TGCAAAGTCTTTGACTCCTTGCT 
SEQ ID NO: 3020 

GTCCAAAGAGTTACTTGCAACAGTCT 
SEQ ID -NO: 3 021 

GGCATGGTTTAGGCCCTGTT 
SEQ ID NO: 3022 

CCAAGATGCAGAGGTTGATGAA 
SEQ ID NO: 3023 

CCGTTTATGGGTAGACATCTTTGG 
SEQ ID NO: 3 024 

GCCATGCCAGCCTTTCTGT 
SEQ ID NO: 3025 

AGCTAGAAGGGCTGGAGAATGC 
SEQ ID NO: 3026 

GGTACAAATTATTTGGCTCGACTTC 
SEQ ID NO: 3027 

TGATGCAATCACACGGGAACT 
SEQ ID NO: 3028 

CATGGCATGGTTAGAAGCTCTATCT 
SEQ ID NO: 3029 

CGTTCTCTCCATTGCTTGTTAGC 
SEQ ID NO: 3030 



Reverse Primer 



TGAAGTCAAACTGCCACATTC 
SEQ ID NO: 3 048 

TGAGAACTGCGGCTGTTCTG 
SEQ ID NO: 3 049 

AGAGGCTTTGTCACTCAGCAAGA 
SEQ ID NO: 3050 

CTGGACTGCTACCTTTCAAAGCTT 
SEQ ID NO: 3051 

CGAGGCGATGACATAGTTCACA 
SEQ ID NO: 3052 

TTGTCCCTGTCCCTCTCTCT 
SEQ ID NO: 3 053 

TGAAGTCCACCTGGGCATCT 
SEQ ID NO: 3054 

CACGTTTATGAGTTGAACTTCTC 
SEQ ID NO: 3055 

TCAAGGCACGGGTTGCTT 
SEQ ID NO: 3056 

ATTAGTAAACATTTTGTCATGCAGCAT 
SEQ ID NO: 3 057 

TGGCTCTAGGTGTCCACTAAAGG 
SEQ ID NO: 3058 

TCGTCTCAGGCTTCCTGCTT 
SEQ ID NO: 3059 

TGTTGGAGTATACGTGTGGACATG 
SEQ ID NO: 3060 

GCAATGAGCTAAGAGCCAACCT 
SEQ ID NO: 3061 

GAACGTCCTGTTGCGAGTCTT 
SEQ ID NO: 3062 

CACTCTGGCAACGGGTCACT 
SEQ ID NO: 3063 

GAGGTCACAGCCGACTTTAAACC 
SEQ ID NO: 3064 

ACACTCTGATGATTCCCACGAACTA 
SEQ ID NO: 3065 

CACAGGACAGGGATGGAGAAG 
SEQ ID NO: 3066 
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PCTUC 

992 
722 
1122 
748 
582 
239 
812 
629 
1078 
1124 
935 
605 
784 
1075 
1125 
991 
1126 



Forward Parimer 



TCAAGGGAGCCAAGAGCTCTT 
SEQ ID NO: 3031 

GACAGCAAGGTGCCCTCAGT 
SEQ ID NO: 3032 

TGTCTGCGAAGAAGGCTAGGAG 
SEQ ID NO: 3033 

TCAAGATCCGTGCTCGCAGT 
SEQ ID NO: 3034 

GTTCAGCGTACATCCGGAGACT 
SEQ ID NO: 3035 

TTGTCATCCGTCTTCTGAC 
SEQ ID NO: 3036 

CGTAAGCAGTATGGCTCCAA 
SEQ ID NO: 3 037 

ACCCAAACTCCACAAAGCCATT 
SEQ ID NO: 3038 

CCCTTCCAAGTAAGTCCAACGA 
SEQ ID NO: 3039 

GACAGTCACAGCAGCCTTGACA 
SEQ ID NO: 3 040 

TGCAGATCCTGAGGATGCTAC 
SEQ ID NO: 3041 

GTGGAGGACAGAAAGCCAAGTG 
SEQ ID NO: 3042 

TGGCTCTCGGTTTCTCTGCTT 
SEQ ID NO: 3043 

TGGCTTGATCAAGGGCCTTA 
SEQ ID NO: 3044 

AGAAGAGCTGCCAGGAAGTGTT 
SEQ ID NO: 3045 

TGGAAAACAGCAAACCACCTT 
SEQ ID NO: 3046 

TTCTGAGGCATTAAGCCAGCA 
SEQ -ID NO: 3047 



Reverse Primer 



TTGACAGTGTGTTTATGTGGAATGTT 
SEQ ID NO: 3067 

GTAGGCGCACACCTTCATCTC 
SEQ ID NO: 3 068 

ATGGACTGAAGCTGTTGTTGCC 
SEQ ID NO: 3069 

GGGATACAGGGTTTCAACGA 
SEQ ID NO: 3070 

TGACCATTTACCCACCACAGGT 
SEQ ID NO: 3071 

• GTGGGCACCTTTGATTCCT 
SEQ ID NO: 3 072 

AGCACCTCCTGCTTGCTTAT 
SEQ ID NO: 3073 

GCCAGGATGAACACGTACATGTA 
SEQ ID NO: 3074 

TGTCAGGTCTGCGAAACTTCfT 
SEQ ID NO: 3075 

TGAACGGCGTGGATTCAATA 
SEQ ID NO: 3076 

TCCTTCTCAGCCAGGTACACAA 
SEQ ID NO: 3077 

GCACCATTTCCTGAGACTTGCT 
SEQ ID NO: 3078 

CGCGGAAGACGCTGTTATT 
SEQ ID NO: 3079 

TGGTCACGTTTCGGTTTCAT 
SEQ ID NO: 3080 

TCCACATGACCAGACTCTCCA 
SEQ ID NO: 3 081 

CAGAGCAGATGCCAAGCCTAA 
SEQ ID NO: 3082 

TGCATGGAGTTGCTGCTGT 
SEQ ID NO: 3083 



206 



01040/1 



CNJ 



m to CO o> 
o cvi o o 



CO 

CM 



o o 



z o 



O CO to 

o o o 



CO t- T- 
CSJ O 



CM ^ 

o o c> 



CM CO 
CO ^ 



f- 

^ CO 



Tj- 

i2 ^ ^ 



CO 
CO 



CM Q 

d Z 



CO 

d 



CO O 



^ tT CO 'i- 
•r^ CO CM ^ 



O 
00 



CO CO o> 

odd 



CM CO 



o>Tr'^ooin^coi-QT-o<o 
ooddgdioJJ'^cJzd '^d 



o 

d 



d 



o 
d 



d 



CM 

d 



GO 
CM 



^iococo<qc»^ir)Q^-;0>i^pcMO>CMiqc»Qrs.coco 



O O O CM CM O 



Ot-1-O^t-Of- 



o o o ^ ^ 



CO 

d 



^I^O>*'?I^^ QCO'^h«;CM<Doq'^CMiqQ 



o o o 



OOOON.-r-C»'r-T-O^QOOOO<OCOr>-OCO'^lOQ<»COCO 



t-COOOOCMOOt-i- 



0"r--r-00'r-OOCM 



O O O ^ CM 



C3> to 

d CO 



s 



ol 



ay 

CO 



I 



CO . 

d 5 



00 lO 

d T-^ 



CM 
lO 



CO 

d 



d ^ 



a 



to 



to 
1^ 



CM ^ 
y-^ CM 

CO ^ 



1^1 

ol 



I 



CO| CO 



w o 

H- CO to 

ol 00 CM 



tOCMi-^COOO^Q 

coioi^SJiodJEz 

OO'^Cn^OiCMlOCl 



oo a> 

lO ^ 



00 



o 



^^'•r-'t-'cR-r^ZZ-r^dT^CM 



^ CO ^ o 5! ^ 



O Q Q O ^ CO T- 

z ^ d d -r^ 



CO 

od 



o 



I 

3 



in 
cvi 



ocof^f^ooO'^MQcor^ 
V-* CM oi • CM cvi CO.* d - Z ^^r! " t-: 

lO 



CO 
CM 



oo o> CO oo a> CM 
CO to to 



0> CM "? Q 

^" t z 



oinmcDi-i— 
dio-r-^cd'i-^d^^z 



00 a> 

^ CM 



CM in 

tri CM 



'^'^oor^r^o>QQco-^ioo 
dddd'^'dzzddd -i-^ 



Z CM d CM d 



^coin<ooojQQinr»-cocM 
-r^cM^TToO'T^ZZcOT-^dd 



C7) 



o 
d 



^ OO CM CD 

CO to lo d 

CM 



O ^ CM CM CO CO 



COCMi-^T^'r^COjB-r^Z 



O) O Q ■ 05 CO 0> 



T- t- O CO ^ 



3 CO 

»— in 



CO o CO OO r; S 

^ to CO <j> 2! ^ 



ioco<j>i!;^gooco 

CMC7>GO^^"'05Tj- 



S S w s s 5 ^ 



S ^ CM g| OO CM a> 
g C3> CM GO CO 

^ c» ^ 1^ to CM 



207 



01040/1 



^CO^OJCOOJCSI-^I^CDOO 



ocdodioocdoT^cocdcd 



or^-cocNjco'^cMCvj'^Tr. 



'f- cqtoioco'^r^h-. cqcooq 
oco oooi-^cvicNiocsico 



T-oor>-<ocor«-Trom<oco 
ocsi^coooocvii-^ocsi 



CO cvi 



oo 

CO 



O CO 
CNI 



o 



O 1- 
1-* lO 



lO 

cvi 



Ti- 
ed 



in 

CO 



CVI 

in 



o 
in 



CO 



cvi 



CO 



COOOOJI^OCVJOOO 

cdcvioocvicvicDcvj 



in 
o 



CO 



cor^cvi^-cvimcvicvi 
c^cviooor^oo 



CVI 

o 



CVI 
CVI 



■r^.CNii^cooipt^in 
o cvi o *-* o T-* o o 



o 



o 



cvjTj- pT-r^ors.'d' 
ocd^cviKcdcvi^ 



■r- CO h. 
cvi -r-^ 1-' 



ooiriocvii-^ocD 



ay 
O 



CO 

1^ 



coooPi-cvjcoinr^i-ooo 
ocvi^^cdi-^T^T^^i-^^ 



m en ^ rt* en »w ^ rt» ^ 



208 



01040/1 



EXAMPLE 8 

Gene Expression In E. coli 

[0041] This example illustrates preparation of an unglycosylated form of 

polypeptide by recombinant expression in E. colL 

[0042] The DNA sequence encoding polypeptide is initially amplified using 

selected PGR primers. The primers should contain restriction enzyme sites that 
correspond to the restriction enzyme sites on the selected expression vector. A variety 
of expression vectors can be employed. An example of a suitable vector is pBR322 
(derived from E, coli; see Bolivar et aL, Gene, 2:95 (1977)) that contains genes for 
ampicillin and tetracycline resistance. The vector is digested with restriction enzyme 
and dephosphorylated. The PGR amplified sequences are then ligated into the vector. 
The vector will preferably include sequences, which encode for an antibiotic 
resistance gene, a trp promoter, a polyhis leader (including the first six STII codons, 
polyhis sequence, and enterokinase cleavage site), the region encoding a protein of the 
present invention, lambda transcriptional terminator, and an argU gene. 
[0043] The ligation mixture is then used to transform a selected E. coli strain 

using the methods described in Sambrook et aL, supra. Transformants are identified 
by their ability to grow on LB plates and antibiotic resistant colonies are then selected. 
Plasmid DNA can be isolated and confirmed by restriction analysis and DNA 
sequencing. 

[0044] Selected clones can be grown overnight in liquid culture medium such 

as LB broth supplemented with antibiotics. The overnight culture can subsequently be 
used to inoculate a larger scale culture. The cells are then grown to a desired optical 
density, during which the expression promoter is turned on. 
[0045] After culturing the cells for several more hours, the cells can be 

harvested by centrifugation. The cell pellet obtained by the centrifugation can be 
solubilized using various agents known in the art, and the solubilized protein can then 
be purified using a metal chelating column under conditions that allow tight binding 
of the protein. 

[0046] Proteins of the present invention can be expressed in E, coli in a poly- 

His tagged form, using the following procedure. The DNA encoding a protein of the 
present invention is initially amplified using selected PGR primers. The primers will 
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contain restriction enzyme sites which correspond to the restriction enzyme sites on 
the selected expression vector, and other useful sequences providing for efficient and 
reliable translation initiation, rapid purification on a metal chelation column, and 
proteolytic removal with enterokinase. The PCR-amplified, poly-His tagged 
sequences are then ligated into an expression vector, which is used to transform an E. 
coli host based on strain 52 (W3 1 10 fuhA(tonA) Ion galE rpoHts(htpRts) clpP(lacIq). 
Transformants are first grown in LB containing 50mg/ml carbenicillin at 30°C with 
shaking until an O.D.600 of 3-5 is reached. Cultures are then diluted 50-100 fold into 
CRAP media (prepared by mixing 3.57 g (NH2)S04, 0.71 g sodium citrate-2H20, 1.07 
g^KCl, 5.36 g Difco yeast extract, 5.36 g Sheffield hycase SF in 500 mL water, as 
well as 1 10 mM MPOS, pH 7.3, 0.55% (w/v) glucose and 7 mM MgS04,) and grown 
for approximately 20-30 hours at 30°C with shaking. Samples are removed to verify 
expression by SDS-PAGE analysis, and the bulk culture is centrifuged to pellet the 
cells. Cell pellets are frozen until purification and refolding. 
[0047] E. coli paste from 0.5 to 1 L fermentations (6- 10 g pellets) is 

resuspended in 10 volumes (w/v) in 7 M guanidine, 20 mM Tris, pH 8 buffer. Solid 
sodium sulfite and sodium tetrathionate is added to make final concentrations of 0. 1 
M and 0.02 M, respectively, and the solution is stirred overnight at A^C. This step 
results in a denatured protein with all cysteine residues blocked by sulfitolization. The 
solution is centrifuged at 40,000 rpm in a Beckman Ultracentrifuge for 30 min. The 
supernatant is diluted with 3-5 volumes of metal chelate column buffer (6 M 
guanidine, 20 mM Tris, pH 7.4) and filtered through 0.22 micron filters to clarify. The 
clarified extract is loaded onto a 5 ml Qiagen Ni-NTA metal chelate column 
equilibrated in the metal chelate column buffer. The column is washed with additional 
buffer containing 50 mM imidazole (Calblochem, Utrol grade), pH 7.4. The protein is 
eluted with buffer containing 250 mM imidazole. Fractions containing the desired 
protein are pooled and stored at 4*'C. Protein concentration is estimated by its 
absorbance at 280 nm using the calculated extinction coefficient based on its amino 
acid sequence. 

[0048] The proteins are refolded by diluting the sample slowly into freshly 

prepared refolding buffer consisting of 20 mM Tris,.pH 8.6, 0.3 M NaCl, 2.5 M urea, 
5 mM cysteine, 20 mM glycine and 1 mM EDTA. Refolding volumes are chosen so 
that the final protein concentration is between 50 to 100 micrograms/ml. The 
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refolding solution is stirred gently at 4°C for 12-36 hours. The refolding reaction is 
quenched by the addition of TFA to a final concentration of 0.4% (pH of 
approximately 3). Before further purification of the protein, the solution is filtered 
through a 0.22 micron filter and acetonitrile is added to 2-10% final concentration. 
The refolded protein is chromatographed on a Poros Rl/H reversed phase column 
using a mobile buffer of 0.1% TFA with elution with a gradient of acetonitrile from 
10 to 80%. Aliquots of fractions with A280 absorbance are analyzed on SDS 
polyacrylamide gels and fractions containing homogeneous refolded protein are 
pooled. Generally, the properly refolded species of most proteins are eluted at the 
lowest concentrations of acetonitrile since those species are the most compact with 
their hydrophobic interiors shielded from interaction with the reversed phase resin. 
Aggregated species are usually eluted at higher acetonitrile concentrations. In addition 
to resolving misfolded forms of proteins. from the desired form, the reversed phase 
step also removes endotoxin from the samples. 

[0049] Fractions containing the desired folded polypeptide are pooled and the 

acetonitrile removed using a gentle stream of nitrogen directed at the solution. 
Proteins are formulated into 20 mM Hepes,.pH 6.8 with 0.14 M sodium chloride and 
4% mannitol by dialysis or by gel filtration using G25 Superfine (Pharmacia) resins 
equilibrated in the formulation buffer and sterile filtered. 

EXAMPLE9 

Expression Of Polypeptides In Mammalian Cells 

[0050] This example illustrates preparation of a potentially glycosylated form 

of polypeptide by recombinant expression in mammalian cells. 
[0051] The vector, pRK5 (see EP 307,247, published March 15, 1989), is 

employed as the expression vector. Optionally, the DNA is ligated into pRK5 with 
selected restriction enzymes to allow insertion of the DNA using ligation methods 
such as described in Sambrook et al, supra., and is designated pRKS-DNA. 
[0052] In one embodiment, the selected host cells can be NIH3T3 cells, using 

the vectors and transfection methods described herein for other mammalian cells. 
Transfected NIH3T3 cells, over-expressing gene encoding a protein of the present 
invention or expressing antisense, are tested for activity. 
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[0053] In one embodiment, the selected host cells can be 293 cells. Human 

293 cells (ATCC CCL 1573) are grown to confluence in tissue culture plates in 
medium such as DMEM supplemented with fetal calf serum and optionally, nutrient 
components and/or antibiotics. About 10 pg pRK5 -DNA is mixed with about 1 |ig 
DNA encoding the VA RNA gene (Thimmappaya et al. Cell, 3 1 :543 ( 1 982)) and 
dissolved in 500 \x\ of 1 mM Tris-HCI, 0.1 mM EDTA, 0.227 M CaCb, To this 
mixture is added, dropwise, 500 \x\ of 50mM HEPES (pH 7.35), 280mM NaCl, 1.5 
mM NaP04, and a precipitate is allowed to form for 10 minutes at 25'*C. The 
precipitate is suspended and added to the 293 cells and allowed to settle for about four 
hours at 37°C. The culture medium is aspirated off and 2 ml of 20% glycerol in PBS is 
added for 30 seconds. The 293 cells are then washed with serum free medium, fresh 
medium is added, and the cells are incubated for about 5 days. 
[0054] Approximately 24 hours after transfection, the culture medium is 

removed and replaced with culture medium (alone) or culture medium containing 
200(iCi/ml ^^S-cysteine and 200|iCi/ml ^^S-methionine. After a 12-hour incubation, 
the conditioned medium is collected, concentrated on a spin filter, and loaded onto a 
15% SDS gel. The processed gel can be. dried and exposed to film for a selected 
period of time to reveal the presence of polypeptide. The cultures containing 
transfected cells can undergo further incubation (in serum free medium) and the 
medium is then tested in selected bioassays. 

[0055] In an alternative technique, a protein of the present invention can be 

introduced into 293 cells transiently using the dextran sulfate method described by 
Somparyrac et at., Proc. Natl. Acad. ScL, 12:7575 (1981). 293 cells are grown to 
maximal density in a spinner flask and 700|ig pRK5-DNA is added. The cells are first 
concentrated from the spinner flask by centrifugation and washed with PBS. The 
DNA-dextran precipitate is incubated on the cell pellet for four hours. The cells are 
treated with 20% glycerol for 90 seconds, washed with tissue culture medium, and 
reintroduced into the spinner flask containing tissue culture medium, 5pg/ml bovine 
insulin, and O.l^g/ml bovine transferrin. After about four days, the conditioned media 
is centrifiiged and filtered to remove cells and debris. The sample containing 
expressed a protein of the present invention can then be concentrated and purified by 
any selected method, such as dialysis and/or column chromatography. 
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[0056] In another embodiment, a protein of the present invention can be 

expressed in CHO cells. The pRK5-DNA can be transfected into CHO cells using 
known reagents such as CaP04, or DEAE-dextran. As described above, the cell 
cultures can be incubated, and the medium replaced with culture medium (alone) or 
medium containing a radiolabel such as ^^S-methionine. After determining the 
presence of polypeptide, the culture medium can be replaced with serum free mediurh. 
Preferably, the cultures are incubated for about 6 days, and then the conditioned 
medium is harvested. The medium containing the expressed a protein of the present 
invention can then be concentrated and purified by any selected method. 
[0057] Epi tope-tagged polypeptide can also be expressed in host CHO cells. 

The DNA can be subcloned out of the pRK5 vector. The subclone insert can undergo 
PGR to fuse in frame with a selected epitope tag such as a poly-His tag or myc tag 
such as in pcDNA3.1/myc-his, (Invitrogen, Carlsbad, CA). This vector has the neo 
gene for selection of stable clones using G41 8 and the human cytomegalovirus 
promoter operationally linked to the genes of the present invention. Labeling can be 
performed, as described above, to verify expression. The culture medium containing . 
the expressed poly-His tagged polypeptide can then be concentrated and purified by 
any selected method, such as by Ni^"*" -chelate affinity chromatography. 
[0058] Protein of the present invention can also be expressed in CHO and/or 

COS cells by a transient expression procedure or in CHO cells by another stable 
expression procedure. 

[0059] Stable expression in CHO cells is performed using the following 

procedure. The proteins are expressed as an IgG construct (immunoadhesin), in which 
the coding sequences for the soluble forms (e,g, extracellular domains) of the 
respective proteins are fused to an IgGl constant region sequence containing the 
hinge, CH2 and CH2 domains and/or is a poly-His tagged form. 
[0060] Following PCR amplification, the respective DNAs are subcloned in a 

CHO expression vector using standard techniques as described in Ausubel et aL, 
Current Protocols of Molecular Biology, Unit 3.16, John Wiley and Sons (1997). 
CHO expression vectors £ire constructed to have compatible restriction sites 5' and 3' 
of the DNA of interest to allow the convenient shuttling of cDNA's. The vector used 
expression in CHO cells is as described in Lucas et aL, NucL Acids Res. 24:9 (1774- 
1779 (1996), and uses the SV40 early promoter/enhancer to drive expression of the 
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cDNA of interest and dihydrofolate reductase (DHFR). DHFR expression permits 
selection for stable maintenance of the plasmid following transfection. 
[0061] Twelve micrograms of the desired plasmid DNA is introduced into 

approximately 10 million CHO cells using commercially available transfection 
reagents Superfect® (Quiagen), Dosper® or Fugene® (Boehringer Mannheim). The 
cells are grown as described in Lucas et aL, supra . Approximately 3 x 10*^ cells are 
frozen in an ampule for further growth and production as described below. 
[0062] The ampules containing the plasmid DNA are thawed by placement 

into water bath and mixed by vortexing. The contents are pipetted into a centrifuge 
tube containing 10 mLs of media and centrifuged at 1000 rpm for 5 minutes. The 
supernatant is aspirated and the cells are resuspended in 10 mL of selective media (0.2 
\xm filtered PS20 with 5% 0.2 gm diafiltered fetal bovine serum). The cells are then 
aliquoted into a 100 mL spinner containing 90 mL of selective media. After 1-2 days, 
the cells are transferred into a 250 mL spinner filled with 1 50 mL selective growth 
medium and incubated at 3TC. After another 2-3 days, 250 mL, 500 mL and 2000 
mL spinners are seeded with 3x10^ cells/mL. The cell media is exchanged with fresh 
media by centrifugation and resuspension in production medium. Although any 
suitable CHO media can be employed, a production medium described in U.S. Patent 
No. 5,122,469, issued June 16, 1992 C£in actually be used. A 3L production spinner is 
seeded at 1.2 x 106 cells/mL. On day 0, the cell number pH is determined. On day 1, 
the spinner is sampled and sparging with filtered air is commenced. On day 2, the 
spinner is sampled, the temperature shifted to 33®C, and 30 mL of 500 g/L glucose 
and 0.6 mL of 10% antifoam {e.g., 35% polydimethylsiloxane emulsion, Dow 
Coming 365 Medical Grade Emulsion) taken. 

[0063] Throughout the production, the pH is adjusted as necessary to keep it at 

around 7.2. After 10 days, or until the viability dropped below 70%, the cell culture is 
harvested by centrifugation and filtering through a 0.22 gm filter. The filtrate was 
either stored at 4°C or immediately loaded onto columns for purification. For the poly- 
His tagged constructs, the proteins are purified using a Ni-NTA column (Qiagen). 
Before purification, imidazole is added to the conditioned media to a concentration of 
5 mM. The conditioned media is pumped onto a 6 ml Ni-NTA column equilibrated in 
20 mM Hepes, pH 7.4, buffer containing 0.3 M NaCl and 5 mM imidazole at a flow 
rate of 4-5 ml/min. at 4°C: After loading, the column is washed with additional 
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equilibration buffer and the protein eluted with equilibration buffer containing 0.25 M 
imidazole. The highly purified protein is subsequently desalted into a storage buffer 
containing 10 mM Hepes, 0.14 M NaCl and 4% mannitol, pH 6.8, with a 25 ml G25 
Superfine (Pharmacia) column and stored at - SO^C. 

[0064] Immunoadhesin (Fc-containing) constructs are purified from the 

conditioned media as follows. The conditioned medium is pumped onto a 5 ml Protein 
A column (Pharmacia) that had been equilibrated in 20 mM Na phosphate buffer, pH 
6.8. After loading, the column is washed extensively with equilibration buffer before 
elution with 100 mM citric acid, pH 3.5. The eluted protein is immediately neutralized 
by collecting one ml fractions into tubes containing 275|iL of 1 M Tris buffer, pH 9. 
The highly purified protein is subsequently desalted into storage buffer as described 
above for the poly-His tagged proteins. The homogeneity is assessed by SDS 
polyacrylamide gels and by N-terminal amino acid sequencing by Edman degradation. 
[0065] Stable expression in CHO cells was also accomplished without fusing 

the gene of interest to the Fc domain. In this case, a plasmid that expresses the gene 
of interest, such as pcDNA3.1 /myc-his, described above, as a c-myc, poly His fusion, 
was transfected into approximately 5x10^ cells in a 35 mm tissue culture dish. The 
following day, the cells were dislodged by trypsin/EDTA or other comparable cell 
dissociation mixture, diluted into 100 ml growth media and plated into ten 100 mm 
tissue culture plates. The following day, the media was removed and replaced with 
media containing the appropriate amount of selection drug. In this case, the selection 
drug is G418 (Geneticin, Life Technologies) added to approximately 800 ug/ml. 
After approximately 10 days, the cells that did not stably take up the DNA are dead 
and small colonies of stably transfected cells are visible. These cells can be 
dissociated either singly, using cloning cylinders (Freshney, 2000), or as a pool and 
plated in larger tissue culture plates. Expression of the gene of interest can be 
detected in the individual clones, after expansion, by the techniques detailed above or 
by other techniques such as Western blotting using antibodies to the protein of interest 
or to the epitope tags. 

[0066] In addition, the gene of interest was cloned into pIRES2-eGFP vector 

(Clonetech). This vector expresses an enhanced green fluorescence protein (eGFP) 
placed downstream from the internal ribosomal entry site (IRES). Cells are 
transfected with this vector and selected using G4 1 8. Cells that take up the DNA can 
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be identified by fluorescence microscopy by identification of green fluorescent cells. 
Moreover, cells that express the gene of interest can be isolated or enriched by 
fluorescence activated cell sorting. Expression of the gene of interest by the GFP 
expressing cells was accomplished as above 

[0067] In addition to CHO, HEK293 and HUVEC cells, the genes of interest 

are expressed in various tumor cell lines using the same vectors and detection 
techniques described above. Some of the human tumor cell lines used include SW480 
(ATCC, CCL-228), HCT-1 16 (ATCC, CCL-247), DLD-1 (ATCC, CCL-221), LS 
1 74T (ATCC, CCL- 1 88) or HT-29, ATCC, HTB-38). 

EXAMPLE 10 

Gene Expression In Yeast 

[0068] Recombinant expression of polypeptide in yeast can also be 

accomplished. 

[0069] First, yeast expression vectors are constructed for intracellular 

production or secretion of a protein of the present invention from the ADH2/GAPDH 
promoter. DNA encoding a protein of the present invention and the promoter is 
inserted into suitable restriction enzyme sites in the selected plasmid to direct 
intracellular expression of a protein of the present invention. For secretion, DNA 
encoding a protein of the present invention can be cloned into the selected plasmid, 
together with DNA encoding the ADH2/GAPDH promoter, a native signal peptide of 
a protein of the present invention or other mammalian signal peptide, or, for example, 
a yeast alpha-factor or invertase secretory signal/leader sequence, and linker 
sequences (if needed) for expression of a protein of the present invention. 
[0070] Yeast cells, such as yeast strain AB 1 10, can then be transformed with 

the expression plasmids described above and cultured in selected fermentation media. 
The transformed yeast supematants can be analyzed by precipitation with 10% 
trichloroacetic acid and separation by SDS-PAGE, followed by staining of the gels 
with Coomassie Blue stain. 

[0071] Recombinant polypeptide can subsequently be isolated and purified by 

removing the yeast cells from the fermentation medium by centrifiigation and then 
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concentrating the medium using selected cartridge filters. The concentrate containing 
polypeptide can further be purified using selected column chromatography resins. 

EXAMPLE 1 1 

Expression Of Pol vpeptides In Baculovirus-Infected Cells 

[0072] The following method describes recombinant expression of 

polypeptide in Baculovirus-infected insect cells. 

[0073] The DNA sequence coding for polypeptide is fused upstream of an 

epitope tag contained within a baculovirus expression vector. Such epitope tags 
include poly-His tags and immunoglobulin tags (like Fc regions of IgG). A variety of 
plasmids can be employed, including plasmids derived from commercially available 
plasmids such as pVL1393 (Novagen). Briefly, the sequence encoding polypeptide or 
the desired portion of the coding sequence such as the sequence encoding the 
extracellular domain of a transmembrane protein or the sequence encoding the mature 
protein if the protein is extracellular is amplified by PGR with primers complementary 
to the 5' and 3' regions. The 5' primer can incorporate flanking (selected) restriction • 
enzyme sites. The product is then digested with those selected restriction enzymes and 
subcloned into the expression vector. 

[0074] Recombinant baculovirus is generated by co-transfecting the above 

plasmid and BaculoGold™ virus DNA (Pharmingen) into Spodoptera frugiperda 
("Sf9") cells (ATCC CRL 1711) using lipofectin (commercially available from 
GIBCO-BRL). After 4-5 days of incubation at 28°C, the released viruses are 
harvested and used for further amplifications. Viral infection and protein expression 
are performed as described by O'Reilley et aL, Baculovirus expression vectors: A 
Laboratory Manual, Oxford: Oxford University Press (1994). 
[0075] Expressed poly-His tagged polypeptide can then be purified, for 

example, by Ni^'*"-chelate affinity chromatography as follows. Extracts are prepared 
from recombinant virus-infected Sf9 cells as described by Rupert et aL, Nature, 
362:175-179 (1993). Briefly, Sf9 cells are washed, resuspended in sonication buffer 
(25 mL Hepes, pH 7.9; 12.5 mM MgCb; 0 1 mM EDTA; 10% glycerol; 0.1% NP-40; 
0.4 M KCl), and sonicated twice for 20 seconds on ice. The sonicates are cleared by 
centrifugation, and the supernatant is diluted 50-fold in loading buffer (50 mM 
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phosphate, 300 mM NaCl, 10% glycerol, pH 7.8) and filtered through a 0.45 gm filter. 
A Ni^^-NTA agarose column (commercially available from Qiagen) is prepared with a 
bed volume of 5 mL, washed with 25 mL of water, and equilibrated with 25 mL of 
loading buffer. The filtered cell extract is loaded onto the column at 0.5 mL per 
minute. The column is washed to baseline A280 with loading buffer, at which point 
fraction collection is started. Next, the column is washed with a secondary wash 
buffer (50 mM phosphate; 300 mM NaCl, 10% glycerol, pH 6.0), which elutes 
nonspecifically bound protein. After reaching A280 baseline again, the column is 
developed with a 0 to 500 mM imidazole gradient in the secondary wash buffer. One 
mL fractions are collected and analyzed by SDS-PAGE and silver staining or Western 
blot with Ni^*^-NTA-conjugated to alkaline phosphatase (Qiagen). Fractions 
containing the eluted Hisio-tagged protein are pooled and dialyzed against loading 
buffer. 

[0076] Alternatively, purification of the IgG tagged (or Fc tagged) a protein of 

the present invention can be performed using known chromatography techniques, 
including for instance. Protein A or protein G column chromatography. 

EXAMPLE 12 , 

Preparation Of Monoclonal Antibodies That Bind A Polvpeptide Of The Present 
Invention 

[0077] This example illustrates preparation of monoclonal antibodies, which 

can specifically bind a protein of the present invention. 

[0078] Techniques for producing the monoclonal antibodies are known in the 

art and are described, for instance, in Goding, supra, Immunogens that can be 
employed include purified a protein of the present invention, fusion proteins 
containing a protein of the present invention, and cells expressing recombinant a 
protein of the present invention on the cell surface. Selection of the immunogen can 
be made by the skilled artisan without undue experimentation. Mice, such as Balb/c, 
are immunized with the immunogen emulsified in complete Freund's adjuvant and 
injected subcutaneously or intraperitoneally in an amount from 1-100 micrograms. 
Alternatively, the immunogen is emulsified in MPL-TDM adjuvant (Ribi 
Immunochemical Research, Hamilton, MT) and injected into the animal's hind 
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footpads. The immunized mice are then boosted 10 to 12 days later with additional 
immunogen emulsified in the selected adjuvant. Thereafter, for several weeks, the 
mice can also be boosted with additional immunization injections. Serum samples can 
be periodically obtained from the mice by retro-orbital bleeding for testing in ELISA 
assays to detect antibodies. 

[0079] After a suitable antibody titer has been detected, the animals "positive" 

for antibodies can be injected with a final intravenous injection of polypeptide. Three 
to four days later, the mice are sacrificed and the spleen cells are harvested. The 
spleen cells are then fused (using 35% polyethylene glycol) to a selected murine 
myeloma cell line such as P3X63AgU.l, available from ATCC, No. CRL 1597. The 
fusions generate hybridoma cells, which can then be plated in 96 well tissue culture 
• plates containing HAT (hypoxanthine, aminopterin, and thymidine) medium to inhibit 
proliferation of non-fused cells, myeloma hybrids, and spleen cell hybrids. 
[0080] The hybridoma cells will be screened in an ELISA for reactivity 

against a protein of the present invention. Determination of "positive" hybridoma cells 
secreting the desired monoclonal antibodies against polypeptide is within the skill in 
the art. 

[0081] The positive hybridoma cells can be injected intraperitoneally into 

syngeneic Balb/c mice to produce ascites containing the monoclonal antibodies 
against a protein of the present invention. Alternatively, the hybridoma cells can be 
grown in tissue culture flasks or roller bottles. Purification of the monoclonal 
antibodies produced in the ascites can be accomplished using ammonium sulfate 
precipitation, followed by gel exclusion chromatography. Alternatively, affinity 
chromatography based upon binding of antibody to protein A or protein G can be 
employed. 

EXAMPLE 13 

Preparation Of Antibodies That Bind A Polvpeptide Of The Present Invention 

[0082] This example illustrates preparation of polyclonal antibodies, which 

can specifically bind a protein of the present invention. 

[0083] Techniques for producing the polyclonal antibodies are known in the 

art and are described, for instance, in Harlow and Lane, Antibodies: A Laboratory 
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Manual, Cold Spring Harbor Laboratory, (1988). Immunogens that can be employed 
include purified a protein of the present invention, fusion proteins containing a protein 
of the present invention, and cells expressing recombinant a protein of the present 
invention on the cell surface or synthetic peptides derived from a protein of present 
invention coupled to a carrier protein. Selection of the immunogen can be made by the 
skilled artisan without undue experimentation. Tables 8-14 show exempliary 
peptides suitable for polyclonal antibody production. Rabbits, such as New Zealand 
White, are immunized with the immunogen emulsified in complete Freund's adjuvant 
and injected subcutaneously or intramuscular in an amount from 50-1000 micrograms. 
The immunized mice are then boosted 2 to 4 weeks later with additional immunogen 
emulsified in the selected adjuvant. Thereafter, for several months, the mice can also 
be boosted with additional immunization injections. Serum samples can be 
periodically obtained from the ear vein of rabbits for testing in ELISA assays to detect 
antibodies. 

[0084] Purification of the polyclonal antibodies from serum can be 

accomplished using ammonium sulfate precipitation, followed by gel exclusion 
chromatography. Alternatively, affinity chromatography based upon binding of 
antibody to protein A, protein G or the immunogen can be employed. 

Table 8 

KIAA 0792 peptides 

CKIEQALAQTGSVAAAPQEALSN (SEQ ID NO:3084) 
CKIELPRDARKETVESHFRDLSN (SEQ ID NO:3085) 
CKIEDLLDKDPYSFGRTTIALSN (SEQ ID NO:3086) 

Table 9 

GPR-56 peptides 

CKIERGHREDFRFASQRNQTLSN (SEQ ID NO:3087) 
CKIEHAPFPAAHPASRSFPDLSN (SEQ ID NO:3088) 
CKIERLQARGGPSPLKSNSDLSN (SEQ ID NO:3089) 
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Table 10 

P cadherin peptides 

VTDQNDHKPKFTQ (SEQ ID NO:3090) 
DANDNAPMFDPQKY (SEQ ID NO:3091) 
DVNEAPVFVPPSK (SEQ ID NO:3092) 
DVNDHGPVPEPRQI (SEQ ID NO:3093) 
RDWVVAPISVPE (SEQ ID NO:3094) 
YTLTIQATDMDGDGSTTTAV (SEQ ID NO:3095) 

Table 1 1 
FGFR3 peptides 

VENKFGSIRQTYTLD (SEQ ID NO:3096) 
GLPANQTAVLGSDVE (SEQ ID NO:3097) 
GLPANQTAILGSDVE (SEQ ID NO:3098) 
PYVTVLKTAGANTTDK (SEQ ID NO:3099) 
PYVTVLKSWISES VEAD (SEQ ID NO:3 100) 
PYVTVLKSWISEVEAD (SEQ ID NO:3101) 

Table 12 
CCK4 peptides 

KQPSSQDALQGRRALLR (SEQ ID NO:3102) 
PAGSIEAQAVLQVLEKLK (SEQ ID NO:3 103) 
KSLQSKDEQQQLDFRRE (SEQ ID NO:3104) 

Table 13 
TM4SF6 

EIKNSFKNNYEKALKQYN (SEQ ID NO:3105) 
DYRDWTDTNYYSEKGFPK (SEQ ID NO:3 106) 
MASPSRRLQTKPVrr (SEQ ID NO:3107) 
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Table 14 
IFITM 

MNHIVQTFSPVNSGQ (SEQ ID NO:3 1 08) 
MSHTVQTFFSPVNSG (SEQ ID NO:3109) 
EMLKEEQEVAMLGGP (SEQ ID NO:3 1 10) 
EMLKEEHEVAVLGGP (SEQ ID NO:3 111) 
KSRDRKMVGDVTGAQ (SEQ ID NO:3 112) 

EXAMPLE 14 

Purification Of Polypeptides Using Specific Antibodies 

[0085] Native or recombinant polypeptides can be purified by a variety of 

standard techniques in the art of protein purification. For example, pro-polypeptide, 
mature polypeptide, or pre-polypeptide is purified by immunoaffinity chromatography 
using antibodies specific for the polypeptide of interest. In general, an immunoaffinity 
column is constructed by covalently coupling the antibody to an activated 
chromatographic resin. Polyclonal immunoglobulins are prepared from immune sera 
either by precipitation with ammonium sulfate or by purification on immobilized 
Protein A (Pharmacia LK.B Biotechnology, Piscataway, N.J.). Likewise, monoclonal 
antibodies are prepared from mouse ascites fluid by ammonium sulfate precipitation 
or chromatography on immobilized Protein A. Partially purified immunoglobulin is 
covalently attached to a chromatographic resin such as CnBr-activated 
SEPHAROSE™ (Pharmacia LK.B Biotechnology). The antibody is coupled to the 
resin, the resin is blocked, and the derivative resin is washed according to the 
manufacturer's instructions. 

[0086] - Such an immunoaffinity column is utilized in the purification of 
polypeptide by preparing a fraction from cells containing polypeptide in a soluble 
form. This preparation is derived by solubilization of the whole cell or of a subcellular 
fraction obtained via differential centrifugation by the addition of detergent or by 
other methods well known in the art. Alternatively, soluble polypeptide containing a 
signal sequence can be secreted in useful quantity into the medium in which the cells 
are grown. 
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A soluble polypeptide-containing preparation is passed over the immunoaffinity 
column, and the column is washed under conditions that allow the preferential 
absorbance of polypeptide (e.g., high ionic strength buffers in the presence of 
detergent). Then, the column is eluted under conditions that disrupt 
antibody/polypeptide binding (e.g., a low pH buffer such as approximately pH 2-3, or 
a high concentration of a chaotrope such as urea or thiocyanate ion), and polypeptide 
is collected. 
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